3 11 1

Zhanchao Zhou

Zcchill

AI & ML interests

None yet

Recent Activity

liked a dataset 4 days ago

MiniByte-666/Dr.SCI

upvoted a paper 23 days ago

Think Longer to Explore Deeper: Learn to Explore In-Context via Length-Incentivized Reinforcement Learning

authored a paper 25 days ago

LLaDA2.0: Scaling Up Diffusion Language Models to 100B

View all activity

Organizations

None yet

liked a dataset 4 days ago

MiniByte-666/Dr.SCI

Viewer • Updated 4 days ago • 891k • 20 • 5

upvoted a paper 23 days ago

Think Longer to Explore Deeper: Learn to Explore In-Context via Length-Incentivized Reinforcement Learning

Paper • 2602.11748 • Published 24 days ago • 30

authored a paper 25 days ago

LLaDA2.0: Scaling Up Diffusion Language Models to 100B

Paper • 2512.15745 • Published Dec 10, 2025 • 87

upvoted 3 papers 25 days ago

P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads

Paper • 2602.09443 • Published 26 days ago • 57

Improving Data and Reward Design for Scientific Reasoning in Large Language Models

Paper • 2602.08321 • Published 27 days ago • 42

LLaDA2.1: Speeding Up Text Diffusion via Token Editing

Paper • 2602.08676 • Published 27 days ago • 68

upvoted a paper 27 days ago

MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration

Paper • 2602.01734 • Published Feb 2 • 32

upvoted a paper about 1 month ago

A2Eval: Agentic and Automated Evaluation for Embodied Brain

Paper • 2602.01640 • Published Feb 2 • 8

upvoted a paper 3 months ago

LLaDA2.0: Scaling Up Diffusion Language Models to 100B

Paper • 2512.15745 • Published Dec 10, 2025 • 87

authored a paper 3 months ago

Every Token Counts: Generalizing 16M Ultra-Long Context in Large Language Models

Paper • 2511.23319 • Published Nov 28, 2025 • 24

upvoted a paper 3 months ago

Every Token Counts: Generalizing 16M Ultra-Long Context in Large Language Models

Paper • 2511.23319 • Published Nov 28, 2025 • 24

commented a paper 3 months ago

Every Token Counts: Generalizing 16M Ultra-Long Context in Large Language Models

Paper • 2511.23319 • Published Nov 28, 2025 • 24 •

authored a paper 4 months ago

Knocking-Heads Attention

Paper • 2510.23052 • Published Oct 27, 2025 • 30

upvoted a paper 4 months ago

Knocking-Heads Attention

Paper • 2510.23052 • Published Oct 27, 2025 • 30

commented a paper 4 months ago

Knocking-Heads Attention

Paper • 2510.23052 • Published Oct 27, 2025 • 30 •

authored 2 papers 7 months ago

QUBE: Enhancing Automatic Heuristic Design via Quality-Uncertainty Balanced Evolution

Paper • 2412.20694 • Published Dec 30, 2024

Grove MoE: Towards Efficient and Superior MoE LLMs with Adjugate Experts

Paper • 2508.07785 • Published Aug 11, 2025 • 29

upvoted 2 papers 7 months ago

MoBE: Mixture-of-Basis-Experts for Compressing MoE-based LLMs

Paper • 2508.05257 • Published Aug 7, 2025 • 13

Grove MoE: Towards Efficient and Superior MoE LLMs with Adjugate Experts

Paper • 2508.07785 • Published Aug 11, 2025 • 29

commented a paper over 1 year ago

Value Residual Learning For Alleviating Attention Concentration In Transformers

Paper • 2410.17897 • Published Oct 23, 2024 • 9 •

Zhanchao Zhou

AI & ML interests

Recent Activity

Organizations

Zcchill's activity