nomic-embed-text-v1.5 GGUF

GGUF format of nomic-ai/nomic-embed-text-v1.5 for use with CrispEmbed and Ollama-compatible runtimes.

Files

File Quantization Size Parity (cos vs HF)
nomic-embed-text-v1.5.gguf F32 ~522 MB 1.0000
nomic-embed-text-v1.5-q8_0.gguf Q8_0 ~139 MB 0.9980

NomicBERT uses SwiGLU which is sensitive to aggressive quantization. Q5_K (cos0.95) and Q4_K (cos0.85) are not provided as they degrade significantly.

Architecture

  • Model: NomicBERT (BERT + RoPE + SwiGLU, 137M params)
  • Embedding dimension: 768 (Matryoshka: 512, 256, 128, 64)
  • Pooling: Mean pooling + L2 normalize
  • Context length: 2,048 tokens
  • License: Apache 2.0

Notes

Ollama-compatible format (bert.* namespace). RoPE-based encoder with SwiGLU FFN.

Downloads last month
691
GGUF
Model size
0.1B params
Architecture
bert
Hardware compatibility
Log In to add your hardware

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for cstr/nomic-embed-text-v1.5-GGUF

Quantized
(34)
this model