| ▲ | mark_l_watson an hour ago | |
You are using Q6 6 bit quantization; on my 32G MacMini I use Q4 and it is faster but when I use it with OpenCode, I set up a task and go outside to walk for ten minutes. Smart, capable, and slow. Still, I love using local models. EDIT: I run with context wired at 64K | ||