Model Merging in Pre-training of Large Language Models Paper • 2505.12082 • Published May 17, 2025 • 40
DAPO: An Open-Source LLM Reinforcement Learning System at Scale Paper • 2503.14476 • Published Mar 18, 2025 • 144