-
The Art of Scaling Reinforcement Learning Compute for LLMs
Paper • 2510.13786 • Published • 33 -
Attention Is All You Need for KV Cache in Diffusion LLMs
Paper • 2510.14973 • Published • 42 -
BitNet Distillation
Paper • 2510.13998 • Published • 60 -
GigaBrain-0: A World Model-Powered Vision-Language-Action Model
Paper • 2510.19430 • Published • 53
Keylhan Paumard--André
keypa
AI & ML interests
Efficient deep learning, LLM fine-tuning, inference optimization, model compression, distributed training, GPU systems, open-source AI infrastructure
Recent Activity
liked a model 34 minutes ago
AEON-7/Qwen3.6-27B-AEON-Ultimate-Uncensored-BF16 new activity about 7 hours ago
keypa/Qwen3.5-9B-Claude-4.7-GGUF:Claude-4.7 or Claude 3 Opus ? liked a model about 8 hours ago
sensenova/SenseNova-U1-8B-MoT