LangFlow: Continuous Diffusion Rivals Discrete in Language Modeling Paper • 2604.11748 • Published 2 days ago • 10
HandX: Scaling Bimanual Motion and Interaction Generation Paper • 2603.28766 • Published 17 days ago • 12
Sparking Scientific Creativity via LLM-Driven Interdisciplinary Inspiration Paper • 2603.12226 • Published Mar 12 • 4
Tool-R0: Self-Evolving LLM Agents for Tool-Learning from Zero Data Paper • 2602.21320 • Published Feb 24 • 12
Steer2Adapt: Dynamically Composing Steering Vectors Elicits Efficient Adaptation of LLMs Paper • 2602.07276 • Published Feb 7 • 11
CodeCircuit: Toward Inferring LLM-Generated Code Correctness via Attribution Graphs Paper • 2602.07080 • Published Feb 6 • 6
Finding Inductive Loop Invariants using Large Language Models Paper • 2311.07948 • Published Nov 14, 2023 • 1
TD-EVAL: Revisiting Task-Oriented Dialogue Evaluation by Combining Turn-Level Precision with Dialogue-Level Comparisons Paper • 2504.19982 • Published Apr 28, 2025
PIPA: A Unified Evaluation Protocol for Diagnosing Interactive Planning Agents Paper • 2505.01592 • Published May 2, 2025
ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models Paper • 2311.07022 • Published Nov 13, 2023 • 2
Hippocrates: An Open-Source Framework for Advancing Large Language Models in Healthcare Paper • 2404.16621 • Published Apr 25, 2024
ReSpAct: Harmonizing Reasoning, Speaking, and Acting Towards Building Large Language Model-Based Conversational AI Agents Paper • 2411.00927 • Published Nov 1, 2024 • 2
Can a Single Model Master Both Multi-turn Conversations and Tool Use? CALM: A Unified Conversational Agentic Language Model Paper • 2502.08820 • Published Feb 12, 2025 • 5