Urban Socio-Semantic Segmentation with Vision-Language Reasoning Paper โข 2601.10477 โข Published 8 days ago โข 154
Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans? Paper โข 2512.13281 โข Published Dec 15, 2025 โข 64
Glance: Accelerating Diffusion Models with 1 Sample Paper โข 2512.02899 โข Published Dec 2, 2025 โข 30
Computer-Use Agents as Judges for Generative User Interface Paper โข 2511.15567 โข Published Nov 19, 2025 โข 53
VCode: a Multimodal Coding Benchmark with SVG as Symbolic Visual Representation Paper โข 2511.02778 โข Published Nov 4, 2025 โข 102
VCode: a Multimodal Coding Benchmark with SVG as Symbolic Visual Representation Paper โข 2511.02778 โข Published Nov 4, 2025 โข 102
Paper2Video: Automatic Video Generation from Scientific Papers Paper โข 2510.05096 โข Published Oct 6, 2025 โข 119