A Survey of Reinforcement Learning for Large Reasoning Models Paper β’ 2509.08827 β’ Published Sep 10, 2025 β’ 190
Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation Paper β’ 2507.10524 β’ Published Jul 14, 2025 β’ 70
view article Article Bringing Fusion Down to Earth: ML for Stellarator Optimization Jul 2, 2025 β’ 77
Running 3.62k The Ultra-Scale Playbook π 3.62k The ultimate guide to training LLM on large GPU Clusters
view article Article Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance May 21, 2025 β’ 38
Qwen/Qwen2.5-Coder-32B-Instruct Text Generation β’ 33B β’ Updated Jan 12, 2025 β’ 182k β’ β’ 1.96k
DAPO: An Open-Source LLM Reinforcement Learning System at Scale Paper β’ 2503.14476 β’ Published Mar 18, 2025 β’ 144