Duke University

university

Verified

https://duke.edu

AI & ML interests

None defined yet.

Recent Activity

MartinKu authored a paper 22 days ago

T2S-Bench & Structure-of-Thought: Benchmarking and Prompting Comprehensive Text-to-Structure Reasoning

lordmarkhor updated a Space about 1 month ago

DukeUniversity/README

uthpalaherath published a Space 3 months ago

DukeUniversity/README

View all activity

authored a paper 22 days ago

T2S-Bench & Structure-of-Thought: Benchmarking and Prompting Comprehensive Text-to-Structure Reasoning

Paper • 2603.03790 • Published 23 days ago • 121

authored 2 papers 22 days ago

MUSE: A Run-Centric Platform for Multimodal Unified Safety Evaluation of Large Language Models

Paper • 2603.02482 • Published 24 days ago • 3

T2S-Bench & Structure-of-Thought: Benchmarking and Prompting Comprehensive Text-to-Structure Reasoning

Paper • 2603.03790 • Published 23 days ago • 121

submitted a paper to Daily Papers 22 days ago

MUSE: A Run-Centric Platform for Multimodal Unified Safety Evaluation of Large Language Models

Paper • 2603.02482 • Published 24 days ago • 3

authored a paper about 1 month ago

SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks

Paper • 2602.12670 • Published Feb 13 • 56

updated a Space about 1 month ago

README

published a Space 3 months ago

README

authored 3 papers 5 months ago

Hierarchical Conditioning of Diffusion Models Using Tree-of-Life for Studying Species Evolution

Paper • 2408.00160 • Published Jul 31, 2024 • 1

BioCLIP 2: Emergent Properties from Scaling Hierarchical Contrastive Learning

Paper • 2505.23883 • Published May 29, 2025 • 2

BIOCAP: Exploiting Synthetic Captions Beyond Labels in Biological Foundation Models

Paper • 2510.20095 • Published Oct 23, 2025 • 1

authored a paper 5 months ago

AsyncVoice Agent: Real-Time Explanation for LLM Planning and Reasoning

Paper • 2510.16156 • Published Oct 17, 2025 • 2

authored 4 papers 5 months ago

DraftAttention: Fast Video Diffusion via Low-Resolution Attention Guidance

Paper • 2505.14708 • Published May 17, 2025

Efficient Multi-modal Large Language Models via Progressive Consistency Distillation

Paper • 2510.00515 • Published Oct 1, 2025 • 42

The Geometry of Reasoning: Flowing Logics in Representation Space

Paper • 2510.09782 • Published Oct 10, 2025 • 7

Why Do Transformers Fail to Forecast Time Series In-Context?

Paper • 2510.09776 • Published Oct 10, 2025 • 3

authored 2 papers 5 months ago

AILuminate: Introducing v1.0 of the AI Risk and Reliability Benchmark from MLCommons

Paper • 2503.05731 • Published Feb 19, 2025 • 3

A Survey of Vibe Coding with Large Language Models

Paper • 2510.12399 • Published Oct 14, 2025 • 50

authored 2 papers 6 months ago

Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play

Paper • 2509.25541 • Published Sep 29, 2025 • 141

Voice Evaluation of Reasoning Ability: Diagnosing the Modality-Induced Performance Gap

Paper • 2509.26542 • Published Sep 30, 2025 • 9

authored a paper 10 months ago

CoreMatching: A Co-adaptive Sparse Inference Framework with Token and Neuron Pruning for Comprehensive Acceleration of Vision-Language Models

Paper • 2505.19235 • Published May 25, 2025 • 4