MotionEdit: Benchmarking and Learning Motion-Centric Image Editing Paper • 2512.10284 • Published 26 days ago • 25
Multi-Crit: Benchmarking Multimodal Judges on Pluralistic Criteria-Following Paper • 2511.21662 • Published Nov 26, 2025 • 11
Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning Paper • 2511.16043 • Published Nov 20, 2025 • 108
VisPlay: Self-Evolving Vision-Language Models from Images Paper • 2511.15661 • Published Nov 19, 2025 • 42
StatEval: A Comprehensive Benchmark for Large Language Models in Statistics Paper • 2510.09517 • Published Oct 10, 2025 • 6 • 4
StatEval: A Comprehensive Benchmark for Large Language Models in Statistics Paper • 2510.09517 • Published Oct 10, 2025 • 6
StatEval: A Comprehensive Benchmark for Large Language Models in Statistics Paper • 2510.09517 • Published Oct 10, 2025 • 6
StatEval: A Comprehensive Benchmark for Large Language Models in Statistics Paper • 2510.09517 • Published Oct 10, 2025 • 6 • 4
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search Paper • 2509.25454 • Published Sep 29, 2025 • 141
VOGUE: Guiding Exploration with Visual Uncertainty Improves Multimodal Reasoning Paper • 2510.01444 • Published Oct 1, 2025 • 19
VOGUE: Guiding Exploration with Visual Uncertainty Improves Multimodal Reasoning Paper • 2510.01444 • Published Oct 1, 2025 • 19
CLUE: Non-parametric Verification from Experience via Hidden-State Clustering Paper • 2510.01591 • Published Oct 2, 2025 • 27
Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models Paper • 2509.06949 • Published Sep 8, 2025 • 55
SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning Paper • 2509.09674 • Published Sep 11, 2025 • 80
Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation Paper • 2509.15194 • Published Sep 18, 2025 • 33
Look Again, Think Slowly: Enhancing Visual Reflection in Vision-Language Models Paper • 2509.12132 • Published Sep 15, 2025 • 6
CDE: Curiosity-Driven Exploration for Efficient Reinforcement Learning in Large Language Models Paper • 2509.09675 • Published Sep 11, 2025 • 28
Parallel-R1: Towards Parallel Thinking via Reinforcement Learning Paper • 2509.07980 • Published Sep 9, 2025 • 101