VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design Paper • 2307.16430 • Published Jul 31, 2023 • 4
IndexTTS: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System Paper • 2502.05512 • Published Feb 8, 2025 • 7
Step-Audio: Unified Understanding and Generation in Intelligent Speech Interaction Paper • 2502.11946 • Published Feb 17, 2025 • 3
Running on Zero Featured 2.84k F5-TTS 🗣 2.84k F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching Paper • 2410.06885 • Published Oct 9, 2024 • 47
EXPRESSO: A Benchmark and Analysis of Discrete Expressive Speech Resynthesis Paper • 2308.05725 • Published Aug 10, 2023 • 2