SpatialEvo: Self-Evolving Spatial Intelligence via Deterministic Geometric Environments Paper • 2604.14144 • Published 5 days ago • 62
You Only Judge Once: Multi-response Reward Modeling in a Single Forward Pass Paper • 2604.10966 • Published 7 days ago • 11
Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation Paper • 2604.10098 • Published 9 days ago • 74
SkillClaw: Let Skills Evolve Collectively with Agentic Evolver Paper • 2604.08377 • Published 11 days ago • 279
VisionFoundry: Teaching VLMs Visual Perception with Synthetic Images Paper • 2604.09531 • Published 10 days ago • 8
WildDet3D: Scaling Promptable 3D Detection in the Wild Paper • 2604.08626 • Published 11 days ago • 238
OpenSpatial: A Principled Data Engine for Empowering Spatial Intelligence Paper • 2604.07296 • Published 12 days ago • 39
Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability Paper • 2604.06628 • Published 12 days ago • 317
HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents Paper • 2604.07430 • Published 12 days ago • 184
Action Images: End-to-End Policy Learning via Multiview Video Generation Paper • 2604.06168 • Published 13 days ago • 14
How Well Do Agentic Skills Work in the Wild: Benchmarking LLM Skill Usage in Realistic Settings Paper • 2604.04323 • Published 14 days ago • 40
Vanast: Virtual Try-On with Human Image Animation via Synthetic Triplet Supervision Paper • 2604.04934 • Published 14 days ago • 44
Running on Zero Agents Featured 11 StyleRenderer 🎨 11 Generate stylized video from game G‑buffer inputs
Token Warping Helps MLLMs Look from Nearby Viewpoints Paper • 2604.02870 • Published 17 days ago • 34
Token Warping Helps MLLMs Look from Nearby Viewpoints Paper • 2604.02870 • Published 17 days ago • 34