Kyle PRO
iky1e
AI & ML interests
None yet
Recent Activity
liked a model 4 days ago
mlx-community/Mega-ASR-8bit liked a model 4 days ago
HKUSTAudio/Talker-T2AVOrganizations
Micro-LLM
Gender Detection
-
alefiury/wav2vec2-large-xlsr-53-gender-recognition-librispeech
Audio Classification • 0.3B • Updated • 74.8k • 47 -
audeering/wav2vec2-large-robust-24-ft-age-gender
Audio Classification • 0.3B • Updated • 1.89M • 53 -
JaesungHuh/voice-gender-classifier
Audio Classification • 15.5M • Updated • 22.9k • 34
Embedding Models
Audio Analysis
-
codelion/whisper-age-estimator
Automatic Speech Recognition • 72.6M • Updated • 196 • 3 -
blackhole33/uzbek-speaker-verification-v4
Updated • 75 • 1 -
alefiury/wav2vec2-large-xlsr-53-gender-recognition-librispeech
Audio Classification • 0.3B • Updated • 74.8k • 47 -
fronx/Fast-FullSubNet
Audio-to-Audio • Updated • 5
Text to Speech
-
parler-tts/parler-tts-mini-expresso
Text-to-Speech • 0.6B • Updated • 1.35k • 117 -
parler-tts/parler-tts-large-v1
Text-to-Speech • 2B • Updated • 11.1k • 273 -
parler-tts/parler-tts-mini-v1
Text-to-Speech • 0.9B • Updated • 28.3k • 153 -
OuteAI/OuteTTS-0.2-500M
Text-to-Speech • 0.5B • Updated • 245 • 310
Music Models
Open Datasets
Language Models
Interesting LLM Models
-
mzbac/llama-3-8B-Instruct-function-calling
Text Generation • 8B • Updated • 9 • • 30 -
hjhj3168/Llama-3-8b-Orthogonalized-exl2
Text Generation • Updated • 73 • 91 -
failspy/kappa-3-phi-abliterated
Text Generation • 4B • Updated • 23 • • 46 -
failspy/kappa-3-phi-3-4k-instruct-abliterated-GGUF
4B • Updated • 89 • 12
DeepFilterNet-MLX
MLX ports of the DeepFilterNet speech enhancement models for Apple Silicon
Audio Encoder
3D
- Configuration errorAgentsFeatured4.78k
TRELLIS
🏢4.78kScalable and Versatile 3D Generation from images
- Running on ZeroAgents191
PSHuman
🏃191PHOTOREALISTIC HUMAN RECONSTRUCTION w/ CROSS-SCALE DIFF
-
microsoft/TRELLIS-image-large
Image-to-3D • Updated • 1.95M • 648 - Runtime errorAgentsFeatured90
GaussianAnything-AIGC3D
🌖90Generate 3D models from 2D images
Speech to Text
-
UsefulSensors/moonshine
Automatic Speech Recognition • Updated • 94 -
UsefulSensors/moonshine-tiny
Automatic Speech Recognition • 27.1M • Updated • 27.8k • 37 -
UsefulSensors/moonshine-base
Automatic Speech Recognition • 61.5M • Updated • 9.29k • 44 -
openai/whisper-large-v3
Automatic Speech Recognition • 2B • Updated • 5.34M • • 5.74k
Text to Image
structured information extraction
Translation
Video Models
Audio Models
Image Models
Super-Resolution
DeepFilterNet-MLX
MLX ports of the DeepFilterNet speech enhancement models for Apple Silicon
Micro-LLM
Audio Encoder
Gender Detection
-
alefiury/wav2vec2-large-xlsr-53-gender-recognition-librispeech
Audio Classification • 0.3B • Updated • 74.8k • 47 -
audeering/wav2vec2-large-robust-24-ft-age-gender
Audio Classification • 0.3B • Updated • 1.89M • 53 -
JaesungHuh/voice-gender-classifier
Audio Classification • 15.5M • Updated • 22.9k • 34
3D
- Configuration errorAgentsFeatured4.78k
TRELLIS
🏢4.78kScalable and Versatile 3D Generation from images
- Running on ZeroAgents191
PSHuman
🏃191PHOTOREALISTIC HUMAN RECONSTRUCTION w/ CROSS-SCALE DIFF
-
microsoft/TRELLIS-image-large
Image-to-3D • Updated • 1.95M • 648 - Runtime errorAgentsFeatured90
GaussianAnything-AIGC3D
🌖90Generate 3D models from 2D images
Embedding Models
Speech to Text
-
UsefulSensors/moonshine
Automatic Speech Recognition • Updated • 94 -
UsefulSensors/moonshine-tiny
Automatic Speech Recognition • 27.1M • Updated • 27.8k • 37 -
UsefulSensors/moonshine-base
Automatic Speech Recognition • 61.5M • Updated • 9.29k • 44 -
openai/whisper-large-v3
Automatic Speech Recognition • 2B • Updated • 5.34M • • 5.74k
Audio Analysis
-
codelion/whisper-age-estimator
Automatic Speech Recognition • 72.6M • Updated • 196 • 3 -
blackhole33/uzbek-speaker-verification-v4
Updated • 75 • 1 -
alefiury/wav2vec2-large-xlsr-53-gender-recognition-librispeech
Audio Classification • 0.3B • Updated • 74.8k • 47 -
fronx/Fast-FullSubNet
Audio-to-Audio • Updated • 5
Text to Image
Text to Speech
-
parler-tts/parler-tts-mini-expresso
Text-to-Speech • 0.6B • Updated • 1.35k • 117 -
parler-tts/parler-tts-large-v1
Text-to-Speech • 2B • Updated • 11.1k • 273 -
parler-tts/parler-tts-mini-v1
Text-to-Speech • 0.9B • Updated • 28.3k • 153 -
OuteAI/OuteTTS-0.2-500M
Text-to-Speech • 0.5B • Updated • 245 • 310
structured information extraction
Music Models
Translation
Open Datasets
Video Models
Language Models
Audio Models
Interesting LLM Models
-
mzbac/llama-3-8B-Instruct-function-calling
Text Generation • 8B • Updated • 9 • • 30 -
hjhj3168/Llama-3-8b-Orthogonalized-exl2
Text Generation • Updated • 73 • 91 -
failspy/kappa-3-phi-abliterated
Text Generation • 4B • Updated • 23 • • 46 -
failspy/kappa-3-phi-3-4k-instruct-abliterated-GGUF
4B • Updated • 89 • 12
Image Models