Mode Seeking meets Mean Seeking for Fast Long Video Generation Paper • 2602.24289 • Published 5 days ago • 32
Visual-Aware CoT: Achieving High-Fidelity Visual Consistency in Unified Models Paper • 2512.19686 • Published Dec 22, 2025
VisPhyWorld: Probing Physical Reasoning via Code-Driven Video Reconstruction Paper • 2602.13294 • Published 24 days ago • 13
Context Forcing: Consistent Autoregressive Video Generation with Long Context Paper • 2602.06028 • Published 27 days ago • 36
Context Forcing: Consistent Autoregressive Video Generation with Long Context Paper • 2602.06028 • Published 27 days ago • 36 • 7
Context Forcing: Consistent Autoregressive Video Generation with Long Context Paper • 2602.06028 • Published 27 days ago • 36 • 7
Context Forcing: Consistent Autoregressive Video Generation with Long Context Paper • 2602.06028 • Published 27 days ago • 36 • 7
Context Forcing: Consistent Autoregressive Video Generation with Long Context Paper • 2602.06028 • Published 27 days ago • 36 • 7
Context Forcing: Consistent Autoregressive Video Generation with Long Context Paper • 2602.06028 • Published 27 days ago • 36
ShowUI-π: Flow-based Generative Models as GUI Dexterous Hands Paper • 2512.24965 • Published Dec 31, 2025 • 42
GARDO: Reinforcing Diffusion Models without Reward Hacking Paper • 2512.24138 • Published Dec 30, 2025 • 29
MoCha Collection The pioneering work in Dialogue-driven Movie Shot Generation • 4 items • Updated Dec 27, 2025 • 2