Inference Providers
Active filters: vLLM
QuantTrio/Qwen3.6-35B-A3B-AWQ
Image-Text-to-Text
• 36B • Updated • 49.4k
• 12
QuantTrio/gemma-4-31B-it-AWQ
Image-Text-to-Text
• 31B • Updated • 96.8k
• 7
QuantTrio/MiniMax-M2.7-AWQ
Text Generation
• 229B • Updated • 83
• 4
QuantTrio/Qwen3.5-27B-AWQ
Image-Text-to-Text
• 28B • Updated • 390k
• 43
unsloth/Mistral-Small-4-119B-2603-GGUF
119B • Updated • 30.4k
• 61
mistralai/Mistral-Small-4-119B-2603
119B • Updated • 84.3k
• 357
mistralai/Mistral-Small-4-119B-2603-eagle
Updated • 297
• 46
Text Generation
• 754B • Updated • 2
QuantTrio/Qwen3-VL-30B-A3B-Instruct-AWQ
Text Generation
• 31B • Updated • 691k
• 42
QuantTrio/GLM-4.7-Flash-AWQ
Text Generation
• 31B • Updated • 88.1k
• 12
QuantTrio/MiniMax-M2.5-AWQ
Text Generation
• 229B • Updated • 89.9k
• 15
QuantTrio/Qwen3.5-35B-A3B-AWQ
Image-Text-to-Text
• 36B • Updated • 154k
• 18
Text Generation
• 586B • Updated • 4.99k
• 6
unsloth/Mistral-Small-4-119B-2603
119B • Updated • 175
• 4
QuantTrio/gemma-4-31B-it-AWQ-6Bit
Image-Text-to-Text
• 31B • Updated • 12.9k
• 7
Xingyu-Zheng/Qwen3.5-9B-GLM5.1-Distill-v1-INT4-FOEM
Image-Text-to-Text
• 9B • Updated • 20
• 1
Xingyu-Zheng/Qwopus3.5-27B-v3.5-INT4-FOEM
Image-Text-to-Text
• 27B • Updated • 559
• 1
model-scope/glm-4-9b-chat-GPTQ-Int4
Text Generation
• 9B • Updated • 129
• 6
model-scope/glm-4-9b-chat-GPTQ-Int8
Text Generation
• 9B • Updated • 7
• 2
tclf90/qwen2.5-72b-instruct-gptq-int4
Text Generation
• 73B • Updated • 100
• 2
tclf90/qwen2.5-72b-instruct-gptq-int3
Text Generation
• 69B • Updated • 73
prithivMLmods/Nu2-Lupi-Qwen-14B
Text Generation
• 15B • Updated • 6
• 2
mradermacher/Nu2-Lupi-Qwen-14B-GGUF
15B • Updated • 132
• 1
mradermacher/Nu2-Lupi-Qwen-14B-i1-GGUF
15B • Updated • 256
• 1
JunHowie/Qwen3-0.6B-GPTQ-Int4
Text Generation
• 0.6B • Updated • 86
• 1
JunHowie/Qwen3-0.6B-GPTQ-Int8
Text Generation
• 0.6B • Updated • 9
JunHowie/Qwen3-1.7B-GPTQ-Int4
Text Generation
• 2B • Updated • 130
• 1
JunHowie/Qwen3-1.7B-GPTQ-Int8
Text Generation
• 2B • Updated • 10
JunHowie/Qwen3-32B-GPTQ-Int4
Text Generation
• 33B • Updated • 27.9k
• 4
JunHowie/Qwen3-32B-GPTQ-Int8
Text Generation
• 33B • Updated • 253
• 4