Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
Libraries
Languages
Licenses
Other
1
Apps
llama.cpp
LM Studio
Jan
Draw Things
DiffusionBee
JoyFusion
vLLM
Ollama
MLX LM
Docker Model Runner
Lemonade
SGLang
Unsloth
Pi
Inference Providers
Select all
Groq
Novita
Cerebras
SambaNova
Nscale
fal
Hyperbolic
Together AI
Fireworks
Featherless AI
Zai
Replicate
Cohere
Scaleway
Public AI
OVHcloud AI Endpoints
HF Inference API
WaveSpeed
Misc
Reset Misc
quark
Inference Endpoints
text-generation-inference
Eval Results (legacy)
text-embeddings-inference
4-bit precision
Merge
custom_code
8-bit precision
Mixture of Experts
Carbon Emissions
Eval Results
Apply filters
Models
150
Full-text search
Inference Available
Edit filters
Sort: Trending
Active filters:
quark
Clear all
amd/DeepSeek-OCR-MXFP4
2B
•
Updated
Feb 6
•
7
amd/Kimi-K2.5-MXFP4
550B
•
Updated
20 days ago
•
64.2k
•
2
amd/Qwen3-Coder-Next-MXFP4
41B
•
Updated
Feb 3
•
470
•
4
amd/Llama-2-70b-chat-hf-WMXFP4-AMXFP4-KVFP8-Scale-UINT8-6.0MLPerf-GPTQ
35B
•
Updated
Feb 5
•
21
amd/gpt-oss-120b-w-mxfp4-a-fp8-Mlperf
59B
•
Updated
Feb 6
•
79
amd/GLM-5-MXFP4
408B
•
Updated
12 days ago
•
11.1k
•
3
amd/MiniMax-M2.5-MXFP4
Text Generation
•
116B
•
Updated
11 days ago
•
7.08k
raikonenamd/gpt-oss-120b-w-mxfp4-a-fp8-kv-fp8-fp8attn
116B
•
Updated
Feb 26
•
1
RyzenAI/Qwen3-VL-4B-Instruct-per-grp-quant
Text Generation
•
1.0B
•
Updated
Feb 27
•
4
amd/Qwen3.5-397B-A17B-MXFP4
Image-Text-to-Text
•
222B
•
Updated
6 days ago
•
15.4k
•
3
ichbinblau/DeepSeek-R1-0528-MXFP4
350B
•
Updated
Mar 4
•
5
amd-quark/Qwen3-30B-A3B-nvfp4-quark
16B
•
Updated
12 days ago
•
58
amd/Step-3.5-Flash-MXFP4
Text Generation
•
102B
•
Updated
11 days ago
•
210
nameistoken/MiniMax-M2.1-Quark-W8A8-INT8
Text Generation
•
229B
•
Updated
Mar 8
•
3
ziliangpeng/DeepSeek-V3-Quark-MXFP4-v4-w4a6
365B
•
Updated
Mar 12
•
56
amd/Qwen3.5-35B-A3B-MXFP4
Image-Text-to-Text
•
21B
•
Updated
19 days ago
•
962
•
1
ziliangpeng/DeepSeek-V3-Quark-MXFP4-v3-w4a4
336B
•
Updated
Mar 20
•
337
amd/DeepSeek-R1-0528-MXFP4-MTP-MoEFP4
350B
•
Updated
21 days ago
•
4.03k
ginsongsong/eagle3-kimik2.5-w4a8
2B
•
Updated
27 days ago
•
50
ginsongsong/Kimi-K2.5-W8A8
1T
•
Updated
26 days ago
•
29
tbmod/Kimi-K2.5-MXFP4-mini
30B
•
Updated
9 days ago
•
1.76k
vmiss33/Qwen1.5-110B-Chat-FP8
111B
•
Updated
26 days ago
•
14
m8than/DeepSeek-V3.2-ptpc-fp8
686B
•
Updated
24 days ago
•
455
nameistoken/tiny-qwen3-moe-w8a8-int8-quark
Updated
24 days ago
•
3.99k
nameistoken/MiniMax-M2.5-Quark-W8A8-INT8
Text Generation
•
229B
•
Updated
21 days ago
•
53
amd/gpt-oss-20b-w-mxfp4-a-bf16
12B
•
Updated
20 days ago
•
695
zhuzhubenenzhu/MiniMax-M2.5-MXFP4
Text Generation
•
116B
•
Updated
11 days ago
•
226
nameistoken/MiniMax-M2.7-Quark-W8A8-INT8
Text Generation
•
229B
•
Updated
8 days ago
•
41
wafer-ai/glm-5-mxfp4-mtp-finetuned
Updated
1 day ago
•
14
amd/gpt-oss-120b-w-mxfp4-a-fp8-kv-fp8-fp8attn-no_lmhead_router
59B
•
Updated
about 7 hours ago
Previous
1
...
3
4
5
Next