Papers - Audio - TTS
updated
Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram
Predictions
Paper
• 1712.05884
• Published
• 3
VoiceCraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild
Paper
• 2403.16973
• Published
• 3
High Fidelity Neural Audio Compression
Paper
• 2210.13438
• Published
• 4
RALL-E: Robust Codec Language Modeling with Chain-of-Thought Prompting
for Text-to-Speech Synthesis
Paper
• 2404.03204
• Published
• 10
Qwen-Audio: Advancing Universal Audio Understanding via Unified
Large-Scale Audio-Language Models
Paper
• 2311.07919
• Published
• 10
Text-to-Speech
• Updated
• 65.8k
• 1.51k
Natural language guidance of high-fidelity text-to-speech with synthetic
annotations
Paper
• 2402.01912
• Published
• 13
F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow
Matching
Paper
• 2410.06885
• Published
• 46
Matcha-TTS: A fast TTS architecture with conditional flow matching
Paper
• 2309.03199
• Published
• 15
SONAR: Sentence-Level Multimodal and Language-Agnostic Representations
Paper
• 2308.11466
• Published
• 1