Binfeng Xu's picture

Binfeng Xu

billxbf

AI & ML interests

evolving back to apes

Recent Activity

updated a model 11 days ago

billxbf/qwen3.5-4b-codex-polar-step72

published a model 11 days ago

billxbf/qwen3.5-4b-codex-polar-step72

upvoted a paper about 2 months ago

ProRL Agent: Rollout-as-a-Service for RL Training of Multi-Turn LLM Agents

View all activity

Organizations

upvoted a paper about 2 months ago

ProRL Agent: Rollout-as-a-Service for RL Training of Multi-Turn LLM Agents

Paper • 2603.18815 • Published Mar 19 • 14

upvoted 2 papers 3 months ago

PhyCritic: Multimodal Critic Models for Physical AI

Paper • 2602.11124 • Published Feb 11 • 55

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

Paper • 2601.22975 • Published Jan 30 • 111

upvoted an article over 1 year ago

Article

Fine-tuning Llama 2 70B using PyTorch FSDP

+2

smangrul, sgugger, lewtun, philschmid

•

Sep 13, 2023

• 32