Mesh LLM

DeepSeek-V3.2-UD-Q4_K_XL

Distributed GGUF inference package for Mesh LLM

Website GitHub Discord

GGUF layer package for running DeepSeek-V3.2-UD-Q4_K_XL across a local Mesh LLM cluster.

This package is derived from unsloth/DeepSeek-V3.2-GGUF and keeps the original GGUF distribution split into per-layer artifacts for distributed inference.

Highlights

Run locally Pool multiple machines OpenAI-compatible Package variant
Private inference on your hardware Split layers across peers Serve /v1/chat/completions locally UD-Q4_K_XL layer package

Model Overview

Property Value
Source model unsloth/DeepSeek-V3.2-GGUF
Model id unsloth/DeepSeek-V3.2-GGUF:UD-Q4_K_XL
Family DeepSeek
Parameter scale not recorded
Quantization UD-Q4_K_XL
Layer count 61
Activation width 7168
Package size 380.1 GB
Source file UD-Q4_K_XL/DeepSeek-V3.2-UD-Q4_K_XL-00001-of-00009.gguf
Package repo meshllm/DeepSeek-V3.2-UD-Q4_K_XL-layers

Recommended Use

  • Local and private inference with Mesh LLM.
  • Multi-machine serving when the full GGUF is too large for one host.
  • OpenAI-compatible chat/completions workflows through Mesh LLM's local API.

For upstream architecture details, chat template guidance, sampling recommendations, license terms, and benchmark notes, see the source model card: unsloth/DeepSeek-V3.2-GGUF.

Quickstart

# Run this on each machine that should contribute memory/compute.
mesh-llm serve --model "meshllm/DeepSeek-V3.2-UD-Q4_K_XL-layers" --split
# Check the mesh and discover the OpenAI-compatible model name.
curl -s http://localhost:3131/api/status
curl -s http://localhost:3131/v1/models
# Send an OpenAI-compatible chat request.
curl -s http://localhost:3131/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "unsloth/DeepSeek-V3.2-GGUF:UD-Q4_K_XL",
    "messages": [{"role": "user", "content": "Write a tiny hello-world function in Rust."}],
    "max_tokens": 128
  }'

Package Variant

Property Value
Format layer-package
Canonical source ref unsloth/DeepSeek-V3.2-GGUF@main/UD-Q4_K_XL/DeepSeek-V3.2-UD-Q4_K_XL-00001-of-00009.gguf
Source revision main
Source SHA-256 44c643a4d2531b66645d46c897aef48c757191411f0dcbe64a69a057d57bb857
Skippy ABI 0.1.14
Package manifest SHA-256 6def4ab3842d547905e658bb478a633c2c5f6cfa9fc82fcc6f02b6129ed85330

What Is Included

Artifact Path Contents SHA-256
Manifest model-package.json Package schema, source identity, checksums 6def4ab3842d547905e658bb478a633c2c5f6cfa9fc82fcc6f02b6129ed85330
Metadata shared/metadata.gguf 0 tensors, 5.0 MB a7464c3ca495bea827bac05331c70e8bfecf940cb7d38bf195352d7a8a8d7eaf
Embeddings shared/embeddings.gguf 1 tensors, 502.1 MB 6f24f2a3dfcc07e0c5a72293f9583638a4d2896f967c7833bd1cafdfa58f4c78
Output head shared/output.gguf 2 tensors, 730.0 MB 604a649254113fbed225a2b6691a3e7031fba51e8025bb3509296055f59d573c
Transformer layers layers/layer-*.gguf 61 layer artifacts, 1083 tensors, 378.9 GB see model-package.json

Validation

Generated by the Mesh LLM HF Jobs splitter from mesh-llm ref main and validated before upload:

skippy-model-package validate-package "/source/UD-Q4_K_XL/DeepSeek-V3.2-UD-Q4_K_XL-00001-of-00009.gguf" "$PACKAGE_DIR"

Links

Downloads last month
4,261
GGUF
Model size
0.6B params
Architecture
deepseek2
Hardware compatibility
Log In to add your hardware

We're not able to determine the quantization variants.

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for meshllm/DeepSeek-V3.2-UD-Q4_K_XL-layers

Quantized
(1)
this model