5 147 6

Donghao Zhou

donghao-zhou

https://correr-zhou.github.io

AI & ML interests

Generative AI

Recent Activity

upvoted a paper 1 day ago

GenClaw: Code-Driven Agentic Image Generation

upvoted a paper 1 day ago

minWM: A Full-Stack Open-Source Framework for Real-Time Interactive Video World Models

upvoted a paper 4 days ago

From Pixels to Words -- Towards Native One-Vision Models at Scale

View all activity

Organizations

upvoted 2 papers 1 day ago

GenClaw: Code-Driven Agentic Image Generation

Paper • 2605.30248 • Published 4 days ago • 31

minWM: A Full-Stack Open-Source Framework for Real-Time Interactive Video World Models

Paper • 2605.30263 • Published 4 days ago • 49

upvoted 2 papers 4 days ago

From Pixels to Words -- Towards Native One-Vision Models at Scale

Paper • 2605.28820 • Published 5 days ago • 68

Soap2Soap: Long Cinematic Video Remaking via Multi-Agent Collaboration

Paper • 2605.17423 • Published 15 days ago • 31

upvoted a paper 5 days ago

SkillEvolBench: Benchmarking the Evolution from Episodic Experience to Procedural Skills

Paper • 2605.24117 • Published 10 days ago • 19

upvoted a paper 7 days ago

Lens: Rethinking Training Efficiency for Foundational Text-to-Image Models

Paper • 2605.21573 • Published 12 days ago • 106

upvoted a paper 9 days ago

GenEvolve: Self-Evolving Image Generation Agents via Tool-Orchestrated Visual Experience Distillation

Paper • 2605.21605 • Published 12 days ago • 13

upvoted 3 papers 12 days ago

upvoted a paper 13 days ago

FashionChameleon: Towards Real-Time and Interactive Human-Garment Video Customization

Paper • 2605.15824 • Published 17 days ago • 64

upvoted 2 papers 17 days ago

Causal Forcing++: Scalable Few-Step Autoregressive Diffusion Distillation for Real-Time Interactive Video Generation

Paper • 2605.15141 • Published 18 days ago • 92

SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer

Paper • 2605.15178 • Published 18 days ago • 84

upvoted 3 papers 18 days ago

CausalCine: Real-Time Autoregressive Generation for Multi-Shot Video Narratives

Paper • 2605.12496 • Published 20 days ago • 29

World Action Models: The Next Frontier in Embodied AI

Paper • 2605.12090 • Published 20 days ago • 67

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture

Paper • 2605.12500 • Published 20 days ago • 191

upvoted 2 papers 19 days ago

WorldReasonBench: Human-Aligned Stress Testing of Video Generators as Future World-State Predictors

Paper • 2605.10434 • Published 21 days ago • 29

Qwen-Image-2.0 Technical Report

Paper • 2605.10730 • Published 21 days ago • 110

upvoted a paper 21 days ago

Anisotropic Modality Align

Paper • 2605.07825 • Published 24 days ago • 27

upvoted a paper 24 days ago

Stream-R1: Reliability-Perplexity Aware Reward Distillation for Streaming Video Generation

Paper • 2605.03849 • Published 27 days ago • 125

Donghao Zhou

AI & ML interests

Recent Activity

Organizations

donghao-zhou's activity