RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time Paper • 2604.11626 • Published 7 days ago • 100
WildDet3D: Scaling Promptable 3D Detection in the Wild Paper • 2604.08626 • Published 11 days ago • 238
MolmoWeb: Open Visual Web Agent and Open Data for the Open Web Paper • 2604.08516 • Published 11 days ago • 42
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published 18 days ago • 481
DreamLite: A Lightweight On-Device Unified Model for Image Generation and Editing Paper • 2603.28713 • Published 21 days ago • 20
STRIDE: When to Speak Meets Sequence Denoising for Streaming Video Understanding Paper • 2603.27593 • Published 22 days ago • 12
MMFace-DiT: A Dual-Stream Diffusion Transformer for High-Fidelity Multimodal Face Generation Paper • 2603.29029 • Published 21 days ago • 13
SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models Paper • 2603.16859 • Published Mar 17 • 248