1 90 426

Xi Yang

ianyeung

IanYeung

AI & ML interests

None yet

Recent Activity

upvoted an article about 1 hour ago

NEO-unify: Building Native Multimodal Unified Models End to End

liked a model 1 day ago

XiaomiMiMo/MiMo-V2.5

liked a model 1 day ago

XiaomiMiMo/MiMo-V2.5-Pro

View all activity

Organizations

None yet

upvoted an article about 1 hour ago

Article

NEO-unify: Building Native Multimodal Unified Models End to End

Mar 5

•

148

upvoted a paper 4 days ago

GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents

Paper • 2604.26752 • Published 5 days ago • 90

upvoted a paper 5 days ago

Co-Director: Agentic Generative Video Storytelling

Paper • 2604.24842 • Published 7 days ago • 16

upvoted a paper 12 days ago

Tstars-Tryon 1.0: Robust and Realistic Virtual Try-On for Diverse Fashion Items

Paper • 2604.19748 • Published 13 days ago • 249

upvoted a paper 13 days ago

EasyVideoR1: Easier RL for Video Understanding

Paper • 2604.16893 • Published 16 days ago • 40

upvoted an article 14 days ago

Article

Vision Language Model Alignment in TRL ⚡️

Aug 7, 2025

•

110

upvoted a paper 17 days ago

HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds

Paper • 2604.14268 • Published 19 days ago • 117

upvoted a paper 18 days ago

Seedance 2.0: Advancing Video Generation for World Complexity

Paper • 2604.14148 • Published 19 days ago • 155

upvoted 2 papers 21 days ago

RefineAnything: Multimodal Region-Specific Refinement for Perfect Local Details

Paper • 2604.06870 • Published 26 days ago • 41

HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents

Paper • 2604.07430 • Published 26 days ago • 187

upvoted 2 papers 27 days ago

Vero: An Open RL Recipe for General Visual Reasoning

Paper • 2604.04917 • Published 28 days ago • 32

OpenWorldLib: A Unified Codebase and Definition of Advanced World Models

Paper • 2604.04707 • Published 28 days ago • 203

upvoted 8 papers about 1 month ago

Xi Yang

AI & ML interests

Recent Activity

Organizations

ianyeung's activity

NEO-unify: Building Native Multimodal Unified Models End to End

Vision Language Model Alignment in TRL ⚡️