YuvrajSingh9886/LFM2.5-350M-grpo-summarization-quality-bleu Summarization • 0.4B • Updated 6 days ago • 258 • 2
NousResearch/DeepHermes-ToolCalling-Specialist-Atropos Reinforcement Learning • 8B • Updated Apr 28, 2025 • 72 • 18
ValueFX9507/Tifa-DeepsexV3-14b-GGUF-Q6 Reinforcement Learning • 15B • Updated Jul 1, 2025 • 13.8k • 44