Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
1
3
Mehul Damani
PRO
mehuldamani
Follow
John6666's profile picture
wjurayj's profile picture
Spechawk's profile picture
3 followers
·
0 following
https://damanimehul.github.io
MehulDamani2
damanimehul
AI & ML interests
Reinforcement Learning, Large Language Models
Recent Activity
updated
a model
about 4 hours ago
mehuldamani/story-gen_llama-sft-partial
published
a model
about 4 hours ago
mehuldamani/story-gen_llama-sft-partial
updated
a model
about 4 hours ago
mehuldamani/story-gen_llama-sft-full
View all activity
Organizations
None yet
mehuldamani
's models
267
Sort: Recently updated
mehuldamani/story-gen_llama-sft-partial
Text Generation
•
8B
•
Updated
about 4 hours ago
mehuldamani/story-gen_llama-sft-full
Text Generation
•
8B
•
Updated
about 4 hours ago
mehuldamani/bug_fixing_new-rl-token-edit
Text Generation
•
8B
•
Updated
about 5 hours ago
mehuldamani/bug_fixing_new-arl-multiply
Text Generation
•
8B
•
Updated
about 5 hours ago
mehuldamani/bug_fixing_new-arl-add_multiply
Text Generation
•
8B
•
Updated
2 days ago
•
137
mehuldamani/bug_fixing_rlvr-7b-nokl-v2
Text Generation
•
8B
•
Updated
2 days ago
•
144
mehuldamani/olmo-sft-v1
Text Generation
•
7B
•
Updated
2 days ago
•
180
mehuldamani/countdown_arl-sft-add_multiply-v8
Text Generation
•
3B
•
Updated
4 days ago
•
221
mehuldamani/countdown_arl-sft-multiply-v8
Text Generation
•
3B
•
Updated
4 days ago
•
228
mehuldamani/countdown_rlvr-v6-high-corrupt
Text Generation
•
3B
•
Updated
4 days ago
•
236
mehuldamani/countdown_rlvr-v6-high-corrupt-gold
Text Generation
•
3B
•
Updated
4 days ago
•
230
mehuldamani/bug_fixing_sft-v1
Text Generation
•
8B
•
Updated
6 days ago
•
270
mehuldamani/code_gen_arl-ast-addmultiply-7b-v1
Text Generation
•
8B
•
Updated
8 days ago
•
339
mehuldamani/code_gen_rlvr-ast-7b-v2
Text Generation
•
8B
•
Updated
8 days ago
•
252
mehuldamani/bug_fixing_arl-7b-addmultiply-v4
Text Generation
•
8B
•
Updated
8 days ago
•
255
mehuldamani/bug_fixing_rlvr-7b-v4
Text Generation
•
8B
•
Updated
8 days ago
•
265
mehuldamani/sft-corrupted-qwen-v3
Text Generation
•
3B
•
Updated
20 days ago
•
1.42k
mehuldamani/sft-corrupted-qwen-v2
Text Generation
•
3B
•
Updated
20 days ago
•
403
mehuldamani/sft-corrupted-qwen-v1
Text Generation
•
3B
•
Updated
21 days ago
•
581
mehuldamani/rlvr-qwen-hmaze-v1
Text Generation
•
3B
•
Updated
22 days ago
•
332
mehuldamani/rlm-qwen-hmaze-v1-high-fifo
Text Generation
•
3B
•
Updated
22 days ago
•
330
mehuldamani/hmaze-oracle-v1-multiply
Text Generation
•
3B
•
Updated
22 days ago
•
310
mehuldamani/hmaze-oracle-v1
Text Generation
•
3B
•
Updated
22 days ago
•
313
mehuldamani/sft-qwen-hmaze-v2
Text Generation
•
3B
•
Updated
23 days ago
•
789
mehuldamani/sft-qwen-hmaze-v1
Text Generation
•
3B
•
Updated
23 days ago
•
442
mehuldamani/sft-qwen-zmaze-v3
Text Generation
•
3B
•
Updated
24 days ago
•
238
mehuldamani/sft-qwen-zmaze-v2
Text Generation
•
3B
•
Updated
25 days ago
•
714
mehuldamani/sft-qwen-zmaze-v1
Text Generation
•
3B
•
Updated
25 days ago
•
476
mehuldamani/sft-qwen-vmaze-v1
Text Generation
•
3B
•
Updated
26 days ago
•
1.1k
mehuldamani/rlvr_multi_k3
Updated
27 days ago
Previous
1
2
3
...
9
Next