| ▲ | kofu 7 hours ago | |||||||
My experience also aligns with this. I'm running gemma4 31B on a 4090 through llm.cpp with unsloth models. I also run Qwen 3.6. Qwen is good for thinking and planning as it is faster, but Gemma4's generated code is much higher quality in the first try (Rust, C++ and C#). so it needs less revisions to be at a level I'm comfortable for merging. | ||||||||
| ▲ | beastman82 6 hours ago | parent | next [-] | |||||||
I second unsloth models. I'm using them over blackwell-oriented nvfp4 models as they are (empirically) top quality and performance. | ||||||||
| ||||||||
| ▲ | 6 hours ago | parent | prev [-] | |||||||
| [deleted] | ||||||||