DeVI: Physics-based Dexterous Human-Object Interaction via Synthetic Video Imitation Paper • 2604.20841 • Published 12 days ago • 24
LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model Paper • 2604.20796 • Published 12 days ago • 239
DR-Venus: Towards Frontier Edge-Scale Deep Research Agents with Only 10K Open Data Paper • 2604.19859 • Published 13 days ago • 51
HiVLA: A Visual-Grounded-Centric Hierarchical Embodied Manipulation System Paper • 2604.14125 • Published 19 days ago • 20
ASGuard: Activation-Scaling Guard to Mitigate Targeted Jailbreaking Attack Paper • 2509.25843 • Published 20 days ago • 19
GlobalSplat: Efficient Feed-Forward 3D Gaussian Splatting via Global Scene Tokens Paper • 2604.15284 • Published 18 days ago • 24
How to Fine-Tune a Reasoning Model? A Teacher-Student Cooperation Framework to Synthesize Student-Consistent SFT Data Paper • 2604.14164 • Published Mar 23 • 34
RAD-2: Scaling Reinforcement Learning in a Generator-Discriminator Framework Paper • 2604.15308 • Published 18 days ago • 29
DR^{3}-Eval: Towards Realistic and Reproducible Deep Research Evaluation Paper • 2604.14683 • Published 18 days ago • 36
HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds Paper • 2604.14268 • Published 19 days ago • 117
GR00T-N1.7 Collection NVIDIA Isaac GR00T N1.7 open vision-language-action (VLA) model for generalized humanoid • 5 items • Updated 13 days ago • 9
view article Article NVIDIA Isaac GR00T N1.7: Open Reasoning VLA Model for Humanoid Robots 16 days ago • 13
The Past Is Not Past: Memory-Enhanced Dynamic Reward Shaping Paper • 2604.11297 • Published 21 days ago • 141
QuanBench+: A Unified Multi-Framework Benchmark for LLM-Based Quantum Code Generation Paper • 2604.08570 • Published Mar 25 • 125
OpenVLThinkerV2: A Generalist Multimodal Reasoning Model for Multi-domain Visual Tasks Paper • 2604.08539 • Published 25 days ago • 49
ClawBench: Can AI Agents Complete Everyday Online Tasks? Paper • 2604.08523 • Published 25 days ago • 261
MegaStyle: Constructing Diverse and Scalable Style Dataset via Consistent Text-to-Image Style Mapping Paper • 2604.08364 • Published 25 days ago • 100
Automating Database-Native Function Code Synthesis with LLMs Paper • 2604.06231 • Published Apr 2 • 17
Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability Paper • 2604.06628 • Published 26 days ago • 323