Submitted by akhaliq 43 Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory · 5 authors 47.9k 2
Submitted by ambean 26 Clinical knowledge in LLMs does not translate to human interactions · 11 authors 11 5
Submitted by lgy0404 23 LLM-Powered GUI Agents in Phone Automation: Surveying Progress and Prospects · 18 authors 158 4
Submitted by judge 18 SPC: Evolving Self-Play Critic via Adversarial Games for LLM Reasoning · 8 authors 25 2
Submitted by QizhiPei 18 CipherBank: Exploring the Boundary of LLM Reasoning Capabilities through Cryptography Challenges · 9 authors 11 4
Submitted by cloudcatcher2 13 Benchmarking Multimodal Mathematical Reasoning with Explicit Visual Dependency · 7 authors 10 3
Submitted by iofu728 8 MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse Attention · 11 authors 1.19k 2
Submitted by soujanyaporia 7 NORA: A Small Open-Sourced Generalist Vision Language Action Model for Embodied Tasks · 8 authors 207 2
Submitted by AaronZ345 6 Versatile Framework for Song Generation with Prompt-based Control · 11 authors 223 2
Submitted by renqiux0302 6 TrustGeoGen: Scalable and Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving · 13 authors 23 2
Submitted by FocusV857 5 ICL CIPHERS: Quantifying "Learning'' in In-Context Learning via Substitution Ciphers · 5 authors 0 2
Submitted by observerw 4 ChiseLLM: Unleashing the Power of Reasoning LLMs for Chisel Agile Hardware Development · 6 authors 23 2