BlurDM: A Blur Diffusion Model for Image Deblurring Paper • 2512.03979 • Published 29 days ago • 3 • 2
VADER: Towards Causal Video Anomaly Understanding with Relation-Aware Large Language Models Paper • 2511.07299 • Published Nov 10, 2025 • 5 • 3
DLER: Doing Length pEnalty Right - Incentivizing More Intelligence per Token via Reinforcement Learning Paper • 2510.15110 • Published Oct 16, 2025 • 15 • 3
TC-LoRA: Temporally Modulated Conditional LoRA for Adaptive Diffusion Control Paper • 2510.09561 • Published Oct 10, 2025 • 7 • 2
Temporal Prompting Matters: Rethinking Referring Video Object Segmentation Paper • 2510.07319 • Published Oct 8, 2025 • 2 • 2
LEAML: Label-Efficient Adaptation to Out-of-Distribution Visual Tasks for Multimodal Large Language Models Paper • 2510.03232 • Published Oct 3, 2025 • 1 • 2
V2V-GoT: Vehicle-to-Vehicle Cooperative Autonomous Driving with Multimodal Large Language Models and Graph-of-Thoughts Paper • 2509.18053 • Published Sep 22, 2025 • 3 • 3
Autoregressive Universal Video Segmentation Model Paper • 2508.19242 • Published Aug 26, 2025 • 28 • 3
V2V-LLM: Vehicle-to-Vehicle Cooperative Autonomous Driving with Multi-Modal Large Language Models Paper • 2502.09980 • Published Feb 14, 2025 • 5 • 4
Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks Paper • 2501.08326 • Published Jan 14, 2025 • 33 • 2
EoRA: Training-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation Paper • 2410.21271 • Published Oct 28, 2024 • 7 • 2