view article Article Nucleus-Image: Scaling Text-to-Image with Sparse Mixture of Experts 17 days ago • 10
Speculative Decoding for Autoregressive Video Generation Paper • 2604.17397 • Published 12 days ago • 11
view article Article How I contributed a new model to the Transformers library using Codex Mar 30 • 50
view article Article Introducing Modular Diffusers - Composable Building Blocks for Diffusion Pipelines +2 Mar 5 • 51
6Bit-Diffusion: Inference-Time Mixed-Precision Quantization for Video Diffusion Models Paper • 2603.18742 • Published Mar 19 • 10
Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing Paper • 2603.12254 • Published Mar 12 • 22
ID-LoRA: Identity-Driven Audio-Video Personalization with In-Context LoRA Paper • 2603.10256 • Published Mar 10 • 23
LiteAttention: A Temporal Sparse Attention for Diffusion Transformers Paper • 2511.11062 • Published Nov 14, 2025 • 33
Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model Paper • 2603.21986 • Published Mar 23 • 125