How can I use this with llama.cpp?

#24
by KeilahElla - opened

How can I use this with llama.cpp?

You can run
llama-server -hf tecaprovn/deepseek-v4-flash-gguf:Q4_K_M

here is the repo link:
https://huggingface.co/tecaprovn/deepseek-v4-flash-gguf

AFAIK, there's no official support for Deepseek-v4, MiMo v2.5, or Ling 2.6 with llama.cpp.

It's a little strange because normally updates are so frequent and fast, but these models remain unsupported despite so many uploads for them.

This comment has been hidden (marked as Off-Topic)
This comment has been hidden (marked as Off-Topic)

Sign up or log in to comment