AI & ML interests

We’re building a research coalition on evaluating evaluations (EvalEval)! Hosted by Hugging Face, University of Edinburgh, and EleutherAI.

Recent Activity

EvalEvalBot 
in evaleval/EEE_datastore about 3 hours ago

Add HELM AIR-Bench v1.16.0 results

1
#70 opened about 3 hours ago by
yifanmai