2 12 3

Nguyen Trung Hieu

JunHill

AI & ML interests

NLP, RL

Recent Activity

upvoted a collection 9 days ago

TrimKV

upvoted a paper 9 days ago

No Prompt Left Behind: Exploiting Zero-Variance Prompts in LLM Reinforcement Learning via Entropy-Guided Advantage Shaping

upvoted a paper 9 days ago

Make Each Token Count: Towards Improving Long-Context Performance with KV Cache Eviction

View all activity

Organizations

upvoted a collection 9 days ago

TrimKV

Collection

A set of models that can run with bounded memory • 13 items • Updated 9 days ago • 1

upvoted 3 papers 9 days ago

No Prompt Left Behind: Exploiting Zero-Variance Prompts in LLM Reinforcement Learning via Entropy-Guided Advantage Shaping

Paper • 2509.21880 • Published Sep 26, 2025 • 54

Make Each Token Count: Towards Improving Long-Context Performance with KV Cache Eviction

Paper • 2605.09649 • Published 11 days ago • 11

Cache What Lasts: Token Retention for Memory-Bounded KV Cache in LLMs

Paper • 2512.03324 • Published Dec 3, 2025 • 2

upvoted a paper about 1 month ago

CORAL: Towards Autonomous Multi-Agent Evolution for Open-Ended Discovery

Paper • 2604.01658 • Published Apr 2 • 55

upvoted an article 6 months ago

Article

Putting RL back in RLHF

vwxyzjn, ArashAhmadian

•

Jun 12, 2024

• 111

upvoted a paper about 1 year ago

Prompt Cache: Modular Attention Reuse for Low-Latency Inference

Paper • 2311.04934 • Published Nov 7, 2023 • 32

upvoted 3 papers over 1 year ago

Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning

Paper • 2407.15762 • Published Jul 22, 2024 • 10

Taipan: Efficient and Expressive State Space Language Models with Selective Attention

Paper • 2410.18572 • Published Oct 24, 2024 • 18

Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation

Paper • 2410.05363 • Published Oct 7, 2024 • 45

upvoted a paper almost 2 years ago

MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models

Paper • 2408.02718 • Published Aug 5, 2024 • 61

upvoted a paper over 2 years ago

Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection

Paper • 2310.11511 • Published Oct 17, 2023 • 79

Nguyen Trung Hieu

AI & ML interests

Recent Activity

Organizations

JunHill's activity

Putting RL back in RLHF