Running 16 Defeating the trainer-generator precision mismatch in TRL 🎯 16 Download research PDF (Pro access required)
Running Featured 75 Distilling 100B+ Models 40x Faster with TRL 📝 75 TRL distillation for 100B+ teachers, 40x faster
google/timesfm-2.5-200m-transformers Time Series Forecasting • 0.2B • Updated 22 days ago • 237k • 75