| ▲ | thomasjb 3 hours ago | |||||||||||||
Unfortunately there's no gguf quants of the assistant model yet: https://huggingface.co/models?other=base_model:quantized:goo... | ||||||||||||||
| ▲ | kristjansson 3 hours ago | parent [-] | |||||||||||||
I think MTP Gemma4 support is still WIP https://github.com/ggml-org/llama.cpp/pull/23398 ? | ||||||||||||||
| ||||||||||||||