Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
59.7
TFLOPS
40
1
8
s
Tom-Neverwinter
Follow
rajafa's profile picture
LeroyDyer's profile picture
21world's profile picture
5 followers
·
22 following
Tom-Neverwinter
AI & ML interests
Making improvements to help the world.
Recent Activity
reacted
to
danielhanchen
's
post
with 🔥
13 days ago
We made a guide on how to run open LLMs in Claude Code, Codex and OpenClaw. Use Gemma 4 and Qwen3.6 GGUFs for local agentic coding on 24GB RAM Run with self-healing tool calls, code execution, web search via the Unsloth API endpoint and llama.cpp Guide: https://unsloth.ai/docs/basics/api
reacted
to
danielhanchen
's
post
with ❤️
13 days ago
We made a guide on how to run open LLMs in Claude Code, Codex and OpenClaw. Use Gemma 4 and Qwen3.6 GGUFs for local agentic coding on 24GB RAM Run with self-healing tool calls, code execution, web search via the Unsloth API endpoint and llama.cpp Guide: https://unsloth.ai/docs/basics/api
reacted
to
SeaWolf-AI
's
post
with 🔥
about 1 month ago
🧬 Darwin-27B-Opus: 86.9% on GPQA Diamond — World #5, Zero Training We are excited to share Darwin-27B-Opus, a 27B model that achieved 86.9% on GPQA Diamond — ranking #5 globally on the HuggingFace leaderboard — without a single gradient update. How? Darwin breeds pretrained models through evolutionary FFN crossbreeding. The father (Qwen3.5-27B) provides the reasoning architecture; the mother (Claude 4.6 Opus Reasoning Distilled) contributes structured chain-of-thought knowledge. CMA-ES automatically discovers optimal per-layer blending ratios — no human tuning required. The result surpasses the original Qwen3.5-27B (85.5%), GLM-5.1 (744B, 86.2%), and Qwen3.5-122B (86.6%). A 27B model outperforming 744B — with zero training, zero data, one GPU, ~2 hours. We also confirmed hybrid vigor on Korean benchmarks: Darwin-27B-KR (2nd generation offspring) surpassed both parents on CLIcK, winning 7 out of 11 categories. The evolutionary optimizer independently assigned 93% of FFN from the Korean-specialized mother while preserving 93% of attention from the reasoning-specialized father — autonomously validating our core principle: FFN carries knowledge, Attention carries reasoning. 📊 Public release: 10 days → 300+ community derivatives, 120K+ downloads. 🔗 Links: Darwin-27B-Opus: https://huggingface.co/FINAL-Bench/Darwin-27B-Opus article: https://huggingface.co/blog/FINAL-Bench/darwin-gpqa Darwin Family Collection: https://huggingface.co/collections/FINAL-Bench/darwin-family If foundation models are raw ore, Darwin is the forge. We are just getting started. 🔥
View all activity
Organizations
None yet
Tom-Neverwinter
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
2 Spaces
almost 2 years ago
Runtime error
14k
Open LLM Leaderboard
🏆
14k
Track, rank and evaluate open LLMs and chatbots
Running
192
Yet Another LLM Leaderboard
🌖
192
Launch a Streamlit web app interface
liked
a model
about 2 years ago
mlabonne/Beyonder-4x7B-v2
Text Generation
•
24B
•
Updated
Mar 4, 2024
•
234
•
128
liked
5 models
about 3 years ago
bigcode/starcoderbase
Text Generation
•
Updated
May 11, 2023
•
4.57k
•
418
bigcode/starcoder
Text Generation
•
16B
•
Updated
Oct 8, 2024
•
21.4k
•
2.95k
TheBloke/WizardLM-7B-uncensored-GGML
Updated
Jun 9, 2023
•
128
TheBloke/WizardLM-7B-uncensored-GPTQ
Text Generation
•
7B
•
Updated
Oct 26, 2023
•
1.37k
•
195
QuickWick/Music-AI-Voices
Updated
Jul 1, 2023
•
682