VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning Paper • 2504.08837 • Published Apr 10, 2025 • 44
Running on Zero Agents 8 RationalRewards 📉 8 Empowering Visual Generation with Rationalized Rewards