Remix.run Logo
lxgr 10 days ago

Am I missing something or are the Ollama versions of this (https://ollama.com/library/gemma4/tags) text-only for now?

philipkglass 10 days ago | parent | next [-]

Since ollama has diverged from llama.cpp, it will take a bit of time for ollama to support multi-modality. If you're using plain llama.cpp it looks like a PR has already merged for this model with vision and audio support:

https://github.com/ggml-org/llama.cpp/pull/24077

zozbot234 10 days ago | parent [-]

They've actually gone back to (a lightly patched) llama.cpp with the 0.30 release a few weeks ago, and have now vendored-in an up to date release. Needless to say this is great news for both projects!

satvikpendem 10 days ago | parent | prev | next [-]

Just use llama.cpp or Unsloth Studio which wraps it, I don't know why anyone use Ollama anymore.

verdverm 10 days ago | parent [-]

I switched from llama.cpp to vLLM because of prompt cache bugs in qwen/gemma models

This is a good starting issue with a bunch of linked/related

https://github.com/ggml-org/llama.cpp/issues/22746

dwroberts 9 days ago | parent | prev | next [-]

Highly recommend just dropping Ollama. You can download binary releases of llama.cpp for every platform and run them trivially in 5 seconds. Ollama serves no purpose other than to take open source work and rebadge as its own, while providing inferior functionality

lxgr 9 days ago | parent | prev | next [-]

To anybody else wondering: Seems like the models supporting image input are just starting to show up. https://ollama.com/library/gemma4:12b-mlx now shows as supporting it, but curiously the overview on https://ollama.com/library/gemma4/tags still lists it as text only. Cache invalidation remains difficult :)

kordlessagain 9 days ago | parent [-]

Yup, the new version of Ollama dropped. Time to update.

Jabrov 10 days ago | parent | prev | next [-]

Stop using ollama

thot_experiment 10 days ago | parent | prev [-]

Ollama is a shitty project that steals from the open source community, don't use it, use llama.cpp instead.