| ▲ | drnick1 4 hours ago | ||||||||||||||||
Works beautifully on a 3090, very usable speed. Don't expect Opus 4.8-level performance, but there are some things you just need to keep local. | |||||||||||||||||
| ▲ | ljosifov 4 hours ago | parent [-] | ||||||||||||||||
True - they are workhorses. Not super bright, but good enough for lots of everyday tasks. I've found sweet spot to be turning thinking off, as it adds small or no value, while increasing the token count and waiting time. Last 27B I used was https://huggingface.co/Jackrong/Qwopus3.6-27B-Coder-GGUF - specifically post-train adapted a bit to run with thinking off. I saw today the 35B-A3B MoE from the same HF acc is out, downloading that rn to try. | |||||||||||||||||
| |||||||||||||||||