| ▲ | firefax 3 days ago | ||||||||||||||||||||||||||||||||||||||||
I've been using Ollama, Gemma3:12b is about all my little air can handle. If anyone has suggestions on other models, as an experiment I tried asking it to design me a new latex resumé and it struggled for two hours with the request to put my name prominently at the top in a grey box with my email and phone number beside it.  | |||||||||||||||||||||||||||||||||||||||||
| ▲ | james2doyle 3 days ago | parent [-] | ||||||||||||||||||||||||||||||||||||||||
I was playing with the new IBM Granite models. They are quick/small and they do seem accurate. You can even try them online in the browser because they are small enough to be loaded via the filesystem: https://huggingface.co/spaces/ibm-granite/Granite-4.0-Nano-W... Not only are they a lot more recent than gemma, they seem really good at tool calling, so probably good for coding tools. I haven’t personally tried it myself for that. The actual page is here: https://huggingface.co/ibm-granite/granite-4.0-h-1b  | |||||||||||||||||||||||||||||||||||||||||
  | |||||||||||||||||||||||||||||||||||||||||