DV-World: Benchmarking Data Visualization Agents in Real-World Scenarios Paper • 2604.25914 • Published Apr 28 • 41
Think and Answer ME: Benchmarking and Exploring Multi-Entity Reasoning Grounding in Remote Sensing Paper • 2603.12788 • Published Mar 20
Revisiting a Pain in the Neck: A Semantic Reasoning Benchmark for Language Models Paper • 2604.16593 • Published Apr 17 • 6
RhythmMamba: Fast, Lightweight, and Accurate Remote Physiological Measurement Paper • 2404.06483 • Published Feb 24, 2025