Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
drwlf
's Collections
Robotron
Robocop
Medra
Claria
Evaluation
MedImaging
Spaces
Psycho
Reasoning
Medical Data
Audiophile
Datasets
Evaluation
updated
Aug 27, 2025
Upvote
-
microsoft/MMLU-CF
Viewer
•
Updated
Jan 8, 2025
•
20.1k
•
2.67k
•
17
microsoft/Taskbench
Viewer
•
Updated
Aug 21, 2024
•
17.3k
•
1.47k
•
35
AdaptLLM/biomed-VQA-benchmark
Viewer
•
Updated
Aug 21, 2025
•
10.2k
•
320
•
6
openai/healthbench
Preview
•
Updated
Aug 27, 2025
•
3.44k
•
142
Upvote
-
Share collection
View history
Collection guide
Browse collections