22 8

Aayush

Aayushfaced

AI & ML interests

None yet

Recent Activity

upvoted a collection 7 days ago

UnifoLM_WBT_Dataset

liked a model 3 months ago

openai-community/gpt2

upvoted an article 4 months ago

We Got Claude to Fine-Tune an Open Source LLM

View all activity

Organizations

None yet

upvoted a collection 7 days ago

UnifoLM_WBT_Dataset

Collection

8 items • Updated 8 days ago • 72

liked a model 3 months ago

openai-community/gpt2

Text Generation • 0.1B • Updated Feb 19, 2024 • 12.3M • 3.17k

upvoted 2 articles 4 months ago

Article

We Got Claude to Fine-Tune an Open Source LLM

Dec 4, 2025

•

618

Article

New in llama.cpp: Model Management

Dec 11, 2025

•

130

upvoted 2 collections 4 months ago

NVIDIA Nemotron V2

Collection

Open, Production-ready Enterprise Models. Nvidia Open Model license. • 9 items • Updated 3 days ago • 102

Inference Optimized Checkpoints (with Model Optimizer)

Collection

A collection of generative models quantized and optimized for inference with Model Optimizer. • 61 items • Updated 1 day ago • 131

upvoted an article 4 months ago

Article

🤔👀🎬🖥️📖 Kimi-VL-A3B-Thinking-2506: A Quick Navigation

Jun 21, 2025

•

liked a Space 5 months ago

The Smol Training Playbook

📚

3.08k

The secrets to building world-class LLMs

liked 3 Spaces 6 months ago

FineWeb: decanting the web for the finest text data at scale

🍷

1.32k

Read a detailed overview of the FineWeb web‑scale text dataset

Robot Learning: A Tutorial

📝

373

Explore the Robot Learning tutorial online

The Ultra-Scale Playbook

🌌

3.76k

The ultimate guide to training LLM on large GPU Clusters

upvoted 2 papers 6 months ago

UI-S1: Advancing GUI Automation via Semi-online Reinforcement Learning

Paper • 2509.11543 • Published Sep 15, 2025 • 50

Scaling Agents via Continual Pre-training

Paper • 2509.13310 • Published Sep 16, 2025 • 117

liked a dataset 7 months ago

InternRobotics/OmniWorld

Viewer • Updated 14 days ago • 6.35B • 31.5k • 89

upvoted 6 papers 7 months ago

AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning

Paper • 2509.08755 • Published Sep 10, 2025 • 57

A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10, 2025 • 193

LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

Paper • 2403.13372 • Published Mar 20, 2024 • 182

Aayush

AI & ML interests

Recent Activity

Organizations

Aayushfaced's activity

We Got Claude to Fine-Tune an Open Source LLM

New in llama.cpp: Model Management

🤔👀🎬🖥️📖 Kimi-VL-A3B-Thinking-2506: A Quick Navigation

The Smol Training Playbook

FineWeb: decanting the web for the finest text data at scale

Robot Learning: A Tutorial

The Ultra-Scale Playbook