UniSurg: A Video-Native Foundation Model for Universal Understanding of Surgical Videos Paper • 2602.05638 • Published Feb 5 • 9
PhysVideoGenerator: Towards Physically Aware Video Generation via Latent Physics Guidance Paper • 2601.03665 • Published Jan 7 • 1
Rethinking JEPA: Compute-Efficient Video SSL with Frozen Teachers Paper • 2509.24317 • Published Sep 29, 2025 • 12
JEPA-VLA: Video Predictive Embedding is Needed for VLA Models Paper • 2602.11832 • Published Feb 12 • 1
VLA-JEPA Collection VLA-JEPA model checkpoints (LIBERO, Pretrain, SimplerEnv) • 3 items • Updated May 28 • 14
Running on Zero Agents Featured 67 Gemma Diffusion Website Builder 🌐 67 Watch a diffusion LLM write a website live, then tweak it
Running on Zero Agents 17 World Tracing Demo 🌍 17 Multilayer-geometry 3D from a single image or 16-frame clip
SANA-Streaming: Real-time Streaming Video Editing with Hybrid Diffusion Transformer Paper • 2605.30409 • Published May 28 • 41