Running on Zero Featured 129 Qwen3-ASR Demo π 129 Transcribe audio to text with multi-language timestamps
Running on Zero Featured 1.76k Dia 1.6B π― 1.76k Generate realistic dialogue from a script, using Dia!
pyannote/speaker-diarization-3.1 Automatic Speech Recognition β’ Updated May 10, 2024 β’ 10.6M β’ 1.74k
Running on Zero Featured 2.08k PuLID-FLUX π€ 2.08k Generate custom images from text and a reference photo
Running on CPU Upgrade 1.01k Open VLM Leaderboard π 1.01k VLMEvalKit Evaluation Results Collection
MattyB95/AST-VoxCelebSpoof-Synthetic-Voice-Detection Audio Classification β’ 86.2M β’ Updated Jan 31, 2024 β’ 121k β’ 4
Running on Zero Featured 5.05k FLUX.1 [Schnell] π 5.05k Generate images from text prompts with FLUX.1 Schnell
Running on L4 Featured 725 StyleTTS 2 π£ 725 Efficient, fast, and natural text to speech with StyleTTS 2!
Configuration error Featured 178 NaturalSpeech3 FACodec π 178 Convert and reconstruct speech files