Understanding Generalization in Role-Playing Models via Information Theory Paper • 2512.17270 • Published 13 days ago • 1
QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management Paper • 2512.12967 • Published 17 days ago • 103
MOA: Multi-Objective Alignment for Role-Playing Agents Paper • 2512.09756 • Published 22 days ago • 3
MOA: Multi-Objective Alignment for Role-Playing Agents Paper • 2512.09756 • Published 22 days ago • 3
Stabilizing Reinforcement Learning with LLMs: Formulation and Practices Paper • 2512.01374 • Published about 1 month ago • 93
DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning Paper • 2511.22570 • Published Nov 27, 2025 • 84
ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration Paper • 2511.21689 • Published Nov 26, 2025 • 111
Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer Paper • 2511.22699 • Published Nov 27, 2025 • 217
DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models Paper • 2512.02556 • Published 30 days ago • 242
LightMem: Lightweight and Efficient Memory-Augmented Generation Paper • 2510.18866 • Published Oct 21, 2025 • 110
Running 3.61k The Ultra-Scale Playbook 🌌 3.61k The ultimate guide to training LLM on large GPU Clusters
Qwen/Qwen3-Next-80B-A3B-Instruct Text Generation • 81B • Updated Sep 17, 2025 • 3.61M • • 931