Datasets, and model checkpoints of our Group Relative Reward Model (GRRM) framework
Sen Yang PRO
double7
AI & ML interests
None yet
Recent Activity
updated
a collection
1 day ago
GRRM published
a model 1 day ago
double7/Qwen2.5-7B-MT-GRRM-Optimized-CLA updated
a dataset 2 days ago
double7/TowerBlocks-MT-CoT-ZhEn Organizations
None yet