| ▲ | Patrick_Devine 10 days ago | ||||||||||||||||
I realize this is a little confusing; we're working w/ the MLX team to bring MLX to other platforms, but we're not quite there yet. The `gemma4:12b-nvfp4` model is specifically for the MLX engine. For the GGUF 4bit variant (i.e. non-macs) you'll need `gemma4:12b-it-q4_K_M` which I just pushed. You'll also need to upgrade to version 0.30.4 which we're just about to release (it's in prerelease and we're running through our last regression tests). | |||||||||||||||||
| ▲ | embedding-shape 9 days ago | parent | next [-] | ||||||||||||||||
I gotta say, having both "gemma4:12b-mlx-bf16" and "gemma4:12b-nvfp4" be MLX-specific, and not labeling all of the MLX-specific ones as such, is a bit different than "little confusing" and more "set up to be confusing" :) > You'll also need to upgrade to version 0.30.4 which we're just about to release Interesting, wasn't Google coordinating today's release with you? Considering the blog post seems to have gone out way before the release even been cut. | |||||||||||||||||
| |||||||||||||||||
| ▲ | spicySpy 9 days ago | parent | prev [-] | ||||||||||||||||
Would you mind to share the link to `gemma4:12b-it-q4_K_M`? | |||||||||||||||||