Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting Paper • 2601.02151 • Published Jan 5 • 108
On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification Paper • 2508.05629 • Published Aug 7, 2025 • 183
view article Article Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective 15 days ago • 51
view article Article How We Built a Semantic Highlight Model To Save Token Cost for RAG 27 days ago • 64
Nested Learning: The Illusion of Deep Learning Architectures Paper • 2512.24695 • Published Dec 31, 2025 • 43
view article Article Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand Dec 4, 2025 • 64
Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate Paper • 2501.17703 • Published Jan 29, 2025 • 59
Kandinsky 5.0 Video Lite Collection Kandinsky 5.0 Video Lite is a lightweight 2B model that generates up to 10-second SD videos from English and Russian prompts with high visual quality. • 9 items • Updated Dec 14, 2025 • 13
Kandinsky 5.0 Video Lite Diffusers Collection Kandinsky 5.0 Video Lite is a lightweight 2B model that generates up to 10-second SD videos from English and Russian prompts with high visual quality. • 8 items • Updated 1 day ago • 5
Kandinsky 5.0 Video Pro Diffusers Collection Kandinsky 5.0 Video Pro is a 19B model that generates high-quality HD videos from English and Russian prompts with controllable camera motion. • 4 items • Updated Dec 14, 2025 • 11
Kandinsky 5.0 Video Pro Collection Kandinsky 5.0 Video Pro is a 19B model that generates high-quality HD videos from English and Russian prompts with controllable camera motion. • 5 items • Updated Dec 14, 2025 • 16
Kandinsky 5.0 Image Lite Collection Kandinsky 5.0 Image Lite is a 6B DiT-based model that generates and edits HD images from English and Russian text prompts with high visual quality. • 4 items • Updated Dec 14, 2025 • 18
MarsRL: Advancing Multi-Agent Reasoning System via Reinforcement Learning with Agentic Pipeline Parallelism Paper • 2511.11373 • Published Nov 14, 2025 • 13