DINO-SAE: DINO Spherical Autoencoder for High-Fidelity Image Reconstruction and Generation Paper • 2601.22904 • Published 8 days ago • 13
Fork-Merge Decoding: Enhancing Multimodal Understanding in Audio-Visual Large Language Models Paper • 2505.20873 • Published May 27, 2025 • 9
Optical-Flow Guided Prompt Optimization for Coherent Video Generation Paper • 2411.15540 • Published Nov 23, 2024
Universal Reasoner: A Single, Composable Plug-and-Play Reasoner for Frozen LLMs Paper • 2505.19075 • Published May 25, 2025 • 21
Chain-of-Zoom: Extreme Super-Resolution via Scale Autoregression and Preference Alignment Paper • 2505.18600 • Published May 24, 2025 • 48
Universal Reasoner: A Single, Composable Plug-and-Play Reasoner for Frozen LLMs Paper • 2505.19075 • Published May 25, 2025 • 21
Universal Reasoner: A Single, Composable Plug-and-Play Reasoner for Frozen LLMs Paper • 2505.19075 • Published May 25, 2025 • 21 • 2
Zero4D: Training-Free 4D Video Generation From Single Video Using Off-the-Shelf Video Diffusion Model Paper • 2503.22622 • Published Mar 28, 2025 • 18
PLADIS: Pushing the Limits of Attention in Diffusion Models at Inference Time by Leveraging Sparsity Paper • 2503.07677 • Published Mar 10, 2025 • 86
Reangle-A-Video: 4D Video Generation as Video-to-Video Translation Paper • 2503.09151 • Published Mar 12, 2025 • 32
Free^2Guide: Gradient-Free Path Integral Control for Enhancing Text-to-Video Generation with Large Vision-Language Models Paper • 2411.17041 • Published Nov 26, 2024 • 13
Free$^2$Guide: Gradient-Free Path Integral Control for Enhancing Text-to-Video Generation with Large Vision-Language Models Paper • 2411.17041 • Published Nov 26, 2024 • 13 • 2
Free$^2$Guide: Gradient-Free Path Integral Control for Enhancing Text-to-Video Generation with Large Vision-Language Models Paper • 2411.17041 • Published Nov 26, 2024 • 13
TweedieMix: Improving Multi-Concept Fusion for Diffusion-based Image/Video Generation Paper • 2410.05591 • Published Oct 8, 2024 • 13
ViBiDSampler: Enhancing Video Interpolation Using Bidirectional Diffusion Sampler Paper • 2410.05651 • Published Oct 8, 2024 • 12
VideoGuide: Improving Video Diffusion Models without Training Through a Teacher's Guide Paper • 2410.04364 • Published Oct 6, 2024 • 29