Efficient Training on Multiple Consumer GPUs with RoundPipe Paper • 2604.27085 • Published 6 days ago • 36
UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors Paper • 2605.00658 • Published 4 days ago • 70
ObjectClear: Complete Object Removal via Object-Effect Attention Paper • 2505.22636 • Published May 28, 2025 • 5
Mutual Forcing: Dual-Mode Self-Evolution for Fast Autoregressive Audio-Video Character Generation Paper • 2604.25819 • Published 7 days ago • 16
ShadowPEFT: Shadow Network for Parameter-Efficient Fine-Tuning Paper • 2604.19254 • Published 14 days ago • 28
UltraShape 1.0: High-Fidelity 3D Shape Generation via Scalable Geometric Refinement Paper • 2512.21185 • Published Dec 24, 2025 • 32
StyleID: A Perception-Aware Dataset and Metric for Stylization-Agnostic Facial Identity Recognition Paper • 2604.21689 • Published 12 days ago • 24
Seeing Fast and Slow: Learning the Flow of Time in Videos Paper • 2604.21931 • Published 12 days ago • 19
ReImagine: Rethinking Controllable High-Quality Human Video Generation via Image-First Synthesis Paper • 2604.19720 • Published 14 days ago • 3
Extending One-Step Image Generation from Class Labels to Text via Discriminative Text Representation Paper • 2604.18168 • Published 15 days ago • 97
UniMesh: Unifying 3D Mesh Understanding and Generation Paper • 2604.17472 • Published 16 days ago • 11
Tstars-Tryon 1.0: Robust and Realistic Virtual Try-On for Diverse Fashion Items Paper • 2604.19748 • Published 14 days ago • 249
SmartPhotoCrafter: Unified Reasoning, Generation and Optimization for Automatic Photographic Image Editing Paper • 2604.19587 • Published 14 days ago • 46
CoInteract: Physically-Consistent Human-Object Interaction Video Synthesis via Spatially-Structured Co-Generation Paper • 2604.19636 • Published 14 days ago • 87
Speculative Decoding for Autoregressive Video Generation Paper • 2604.17397 • Published 16 days ago • 11
AnyRecon: Arbitrary-View 3D Reconstruction with Video Diffusion Model Paper • 2604.19747 • Published 14 days ago • 38
Less Gaussians, Texture More: 4K Feed-Forward Textured Splatting Paper • 2603.25745 • Published Mar 26 • 16
LeapAlign: Post-Training Flow Matching Models at Any Generation Step by Building Two-Step Trajectories Paper • 2604.15311 • Published 19 days ago • 12
HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds Paper • 2604.14268 • Published 20 days ago • 117