Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
MarioBarbeque
's Collections
Catastrophic Forgetting in Mathematical Reasoning
Code Generation
Finetuning
Mathematics
Finetuning
updated
Dec 2, 2025
Models to fine-tune (and datasets to ft with) in future projects
Upvote
1
nvidia/Llama-3.1-Nemotron-70B-Instruct-HF
Text Generation
•
71B
•
Updated
Apr 13, 2025
•
11k
•
2.06k
FacebookAI/roberta-base
Fill-Mask
•
0.1B
•
Updated
Feb 19, 2024
•
14.3M
•
•
574
openai-community/gpt2
Text Generation
•
0.1B
•
Updated
Feb 19, 2024
•
11.8M
•
3.14k
google/gemma-2-9b
Text Generation
•
Updated
Aug 7, 2024
•
52.1k
•
•
695
google/gemma-2-2b
Text Generation
•
Updated
Aug 7, 2024
•
433k
•
634
google/gemma-2-2b-it
Text Generation
•
3B
•
Updated
Aug 27, 2024
•
412k
•
•
1.31k
google/gemma-1.1-2b-it
Text Generation
•
3B
•
Updated
Jun 27, 2024
•
83.3k
•
172
nvidia/HelpSteer2
Viewer
•
Updated
Dec 18, 2024
•
21.4k
•
7.8k
•
441
HuggingFaceH4/no_robots
Viewer
•
Updated
Apr 18, 2024
•
10k
•
7.91k
•
538
cais/mmlu
Viewer
•
Updated
Mar 8, 2024
•
231k
•
395k
•
689
EleutherAI/gpt-j-6b
Text Generation
•
Updated
Jun 21, 2023
•
117k
•
1.52k
google/flan-t5-large
0.8B
•
Updated
Jul 17, 2023
•
385k
•
877
deepseek-ai/DeepSeek-R1
Text Generation
•
685B
•
Updated
Mar 27, 2025
•
1.87M
•
•
13.1k
google/flan-t5-small
77M
•
Updated
Oct 10, 2023
•
589k
•
469
google/flan-t5-base
Updated
Jul 17, 2023
•
1.16M
•
1.06k
Upvote
1
Share collection
View history
Collection guide
Browse collections