6 1

Barış Deniz Sağlam

bdsaglam

bdsaglam

AI & ML interests

language models, reinforcement learning

Recent Activity

liked a Space 1 day ago

AdithyaSK/rl-environments-guide

upvoted an article 2 months ago

The Engineering Handbook for GRPO + LoRA with Verl: Training Qwen2.5 on Multi-GPU

updated a model 3 months ago

bdsaglam/Nemotron-Cascade-14B-Thinking

View all activity

Organizations

None yet

liked a Space 1 day ago

The ultimate guide to RL environments: building and scaling them in the LLM era

📝

109

Building and scaling RL environments for LLM training

upvoted an article 2 months ago

Article

The Engineering Handbook for GRPO + LoRA with Verl: Training Qwen2.5 on Multi-GPU

Jan 2

•

updated a model 3 months ago

bdsaglam/Nemotron-Cascade-14B-Thinking

15B • Updated Feb 13 • 4 • 1

published a model 3 months ago

bdsaglam/Nemotron-Cascade-14B-Thinking

15B • Updated Feb 13 • 4 • 1

updated a dataset 8 months ago

bdsaglam/hover

Viewer • Updated Aug 31, 2025 • 26.2k • 117

published a dataset 8 months ago

bdsaglam/hover

Viewer • Updated Aug 31, 2025 • 26.2k • 117

published a model 9 months ago

bdsaglam/Qwen2.5-14B-Instruct-ragent-grpo-20250807_073051

Updated Aug 7, 2025

updated 2 models 11 months ago

bdsaglam/Llama-3.1-8B-Instruct-ragent-grpo-20250603_205328

Updated Jun 6, 2025

bdsaglam/Llama-3.1-8B-Instruct-ragent-20250603_205328-merged

8B • Updated Jun 5, 2025 • 1

published 2 models 11 months ago

bdsaglam/Llama-3.1-8B-Instruct-ragent-20250603_205328-merged

8B • Updated Jun 5, 2025 • 1

bdsaglam/Llama-3.1-8B-Instruct-ragent-grpo-20250603_205328

Updated Jun 6, 2025

updated a model 11 months ago

bdsaglam/Llama-3.1-8B-Instruct-ragent-grpo-20250602_094840

Updated Jun 3, 2025

published 2 models 11 months ago

bdsaglam/Llama-3.1-8B-Instruct-ragent-grpo-20250602_094840

Updated Jun 3, 2025

bdsaglam/Llama-3.1-8B-Instruct-ragent-grpo-20250601_195603

Updated Jun 1, 2025

updated a model 11 months ago

bdsaglam/Llama-3.1-8B-Instruct-ragent-grpo-20250531_141657

8B • Updated Jun 1, 2025 • 1

published a model 11 months ago

bdsaglam/Llama-3.1-8B-Instruct-ragent-grpo-20250531_141657

8B • Updated Jun 1, 2025 • 1

updated a model 11 months ago

bdsaglam/Qwen2.5-14B-Instruct-ragent-grpo-20250530_155020

Updated May 31, 2025

published a model 11 months ago

bdsaglam/Qwen2.5-14B-Instruct-ragent-grpo-20250530_155020

Updated May 31, 2025

updated a dataset 11 months ago

bdsaglam/triviaqa-wiki-musique-mini

Viewer • Updated May 30, 2025 • 600 • 34

updated a model 12 months ago

bdsaglam/Qwen2.5-14B-Instruct-ragent-grpo-20250529_190051

Updated May 30, 2025

Barış Deniz Sağlam

AI & ML interests

Recent Activity

Organizations

bdsaglam's activity

The ultimate guide to RL environments: building and scaling them in the LLM era

The Engineering Handbook for GRPO + LoRA with Verl: Training Qwen2.5 on Multi-GPU