-
aMUSEd: An Open MUSE Reproduction
Paper • 2401.01808 • Published • 31 -
From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations
Paper • 2401.01885 • Published • 28 -
SteinDreamer: Variance Reduction for Text-to-3D Score Distillation via Stein Identity
Paper • 2401.00604 • Published • 6 -
LARP: Language-Agent Role Play for Open-World Games
Paper • 2312.17653 • Published • 33
Collections
Discover the best community collections!
Collections including paper arxiv:2312.03047
-
GFPGAN
😁1.1kEnhance and restore old photos and AI-generated faces
-
MagicStick: Controllable Video Editing via Control Handle Transformations
Paper • 2312.03047 • Published • 11 -
PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding
Paper • 2312.04461 • Published • 62 -
CutLER
🌖35Upload an image to enhance its details
-
Make Pixels Dance: High-Dynamic Video Generation
Paper • 2311.10982 • Published • 68 -
Emu Video: Factorizing Text-to-Video Generation by Explicit Image Conditioning
Paper • 2311.10709 • Published • 25 -
AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort
Paper • 2311.11243 • Published • 16 -
LivePhoto: Real Image Animation with Text-guided Motion Control
Paper • 2312.02928 • Published • 18
-
DreaMoving: A Human Dance Video Generation Framework based on Diffusion Models
Paper • 2312.05107 • Published • 38 -
Customizing Motion in Text-to-Video Diffusion Models
Paper • 2312.04966 • Published • 11 -
Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models
Paper • 2312.04410 • Published • 15 -
AnimateZero: Video Diffusion Models are Zero-Shot Image Animators
Paper • 2312.03793 • Published • 18
-
aMUSEd: An Open MUSE Reproduction
Paper • 2401.01808 • Published • 31 -
From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations
Paper • 2401.01885 • Published • 28 -
SteinDreamer: Variance Reduction for Text-to-3D Score Distillation via Stein Identity
Paper • 2401.00604 • Published • 6 -
LARP: Language-Agent Role Play for Open-World Games
Paper • 2312.17653 • Published • 33
-
DreaMoving: A Human Dance Video Generation Framework based on Diffusion Models
Paper • 2312.05107 • Published • 38 -
Customizing Motion in Text-to-Video Diffusion Models
Paper • 2312.04966 • Published • 11 -
Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models
Paper • 2312.04410 • Published • 15 -
AnimateZero: Video Diffusion Models are Zero-Shot Image Animators
Paper • 2312.03793 • Published • 18
-
GFPGAN
😁1.1kEnhance and restore old photos and AI-generated faces
-
MagicStick: Controllable Video Editing via Control Handle Transformations
Paper • 2312.03047 • Published • 11 -
PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding
Paper • 2312.04461 • Published • 62 -
CutLER
🌖35Upload an image to enhance its details
-
Make Pixels Dance: High-Dynamic Video Generation
Paper • 2311.10982 • Published • 68 -
Emu Video: Factorizing Text-to-Video Generation by Explicit Image Conditioning
Paper • 2311.10709 • Published • 25 -
AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort
Paper • 2311.11243 • Published • 16 -
LivePhoto: Real Image Animation with Text-guided Motion Control
Paper • 2312.02928 • Published • 18