text-to-speech
updated
FlashSpeech: Efficient Zero-Shot Speech Synthesis
Paper
• 2404.14700
• Published
• 32
Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale
Paper
• 2306.15687
• Published
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and
Diffusion Models
Paper
• 2403.03100
• Published
• 38
Tango 2: Aligning Diffusion-based Text-to-Audio Generations through
Direct Preference Optimization
Paper
• 2404.09956
• Published
• 12
Mega-TTS 2: Zero-Shot Text-to-Speech with Arbitrary Length Speech
Prompts
Paper
• 2307.07218
• Published
• 28
Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive
Bias
Paper
• 2306.03509
• Published
• 5
parler-tts/dac_44khZ_8kbps
76.7M • Updated
• 110
• 19
parler-tts/parler_tts_mini_v0.1
Text-to-Speech
• 0.6B • Updated
• 3.99k
• 358
Wenetspeech4TTS/WenetSpeech4TTS
Updated
• 558
• 85
Text-to-Audio
• Updated
• 2
• 9
Feature Extraction
• 96.2M • Updated
• 540k
• • 289
Text-to-Speech
• Updated
• 10.1M
• • 5.79k
Text-to-Speech
• 4B • Updated
• 1.14k
• 525
Text-to-Speech
• Updated
• 3.3k
• 1.1k
stepfun-ai/Step-Audio-TTS-3B
Text-to-Speech
• 4B • Updated
• 33
• 196
Text-to-Speech
• Updated
• 86
• 414
Text-to-Speech
• Updated
• 83.3k
• • 2.83k