GUI vs. CLI: Execution Bottlenecks in Screen-Only and Skill-Mediated Computer-Use Agents Paper • 2606.24551 • Published 8 days ago • 28
JetSpec: Breaking the Scaling Ceiling of Speculative Decoding with Parallel Tree Drafting Paper • 2606.18394 • Published 5 days ago • 32
Qwen-AgentWorld: Language World Models for General Agents Paper • 2606.24597 • Published 7 days ago • 139
Beyond Static Leaderboards: Predictive Validity for the Evaluation of LLM Agents Paper • 2606.19704 • Published 12 days ago • 41
Moebius: 0.2B Lightweight Image Inpainting Framework with 10B-Level Performance Paper • 2606.19195 • Published 13 days ago • 138
LoopCoder-v2: Only Loop Once for Efficient Test-Time Computation Scaling Paper • 2606.18023 • Published 14 days ago • 207
view article Article Welcome NVIDIA Cosmos 3: The First Open Omni-model for Physical AI Reasoning and Action nvidia • 29 days ago • 84
Running on Zero Agents Featured 59 Cosmos3-Nano 🌌 59 NVIDIA Cosmos3-Nano — text/image to video + audio
Toward Generalist Autonomous Research via Hypothesis-Tree Refinement Paper • 2606.11926 • Published 20 days ago • 121
AnchorWorld: Embodied Egocentric World Simulation with View-based Evolution Customization Paper • 2606.07326 • Published 25 days ago • 29