mssfj/Qwen2.5-7B-Instruct_dbbench_grpo_dataset_react-2 Text Generation • 8B • Updated 4 days ago • 26
mssfj/Qwen2.5-7B-Instruct_grpo_alfworld_trajectory_dataset-2 Text Generation • 8B • Updated 4 days ago • 26
mssfj/Qwen2.5-7B-Instruct_grpo_alfworld_trajectory_dataset Text Generation • 8B • Updated 4 days ago • 75
mssfj/Qwen2.5-7B-Instruct_sft_alfworld_trajectory_dataset_v5-15 Text Generation • 8B • Updated 7 days ago
mssfj/Qwen2.5-7B-Instruct_sft_alfworld_trajectory_dataset_v5-14 Text Generation • 8B • Updated 7 days ago
mssfj/Qwen2.5-7B-Instruct_sft_alfworld_trajectory_dataset_v5-13 Text Generation • 8B • Updated 7 days ago