Mnemis: Dual-Route Retrieval on Hierarchical Graphs for Long-Term LLM Memory Paper • 2602.15313 • Published 3 days ago • 2
Improving Data and Reward Design for Scientific Reasoning in Large Language Models Paper • 2602.08321 • Published 11 days ago • 40
Improving Data and Reward Design for Scientific Reasoning in Large Language Models Paper • 2602.08321 • Published 11 days ago • 40
MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration Paper • 2602.01734 • Published 18 days ago • 32
The Era of Agentic Organization: Learning to Organize with Language Models Paper • 2510.26658 • Published Oct 30, 2025 • 29
MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration Paper • 2602.01734 • Published 18 days ago • 32
MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration Paper • 2602.01734 • Published 18 days ago • 32
SIGMA: An AI-Empowered Training Stack on Early-Life Hardware Paper • 2512.13488 • Published Dec 15, 2025
DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle Paper • 2512.04324 • Published Dec 3, 2025 • 154
Beyond Length: Quantifying Long-Range Information for Long-Context LLM Pretraining Data Paper • 2510.25804 • Published Oct 29, 2025 • 1
Beyond Length: Quantifying Long-Range Information for Long-Context LLM Pretraining Data Paper • 2510.25804 • Published Oct 29, 2025 • 1 • 1
Beyond Length: Quantifying Long-Range Information for Long-Context LLM Pretraining Data Paper • 2510.25804 • Published Oct 29, 2025 • 1
Learning from the Best, Differently: A Diversity-Driven Rethinking on Data Selection Paper • 2510.18909 • Published Oct 21, 2025 • 5
Learning from the Best, Differently: A Diversity-Driven Rethinking on Data Selection Paper • 2510.18909 • Published Oct 21, 2025 • 5