| ▲ | keerthiko 8 hours ago | ||||||||||||||||||||||
Claude Code is fairly notoriously token inefficient as far as coding agent/harnesses go (i come from aider pre-CC). It's only viable because the Max subscriptions give you approximately unlimited token budget, which resets in a few hours even if you hit the limit. But this also only works because cloud models have massive token windows (1M tokens on opus right now) which is a bit difficult to make happen locally with the VRAM needed. And if you somehow managed to open up a big enough VRAM playground, the open weights models are not quite as good at wrangling such large context windows (even opus is hardly capable) without basically getting confused about what they were doing before they finish parsing it. | |||||||||||||||||||||||
| ▲ | unsnap_biceps 8 hours ago | parent | next [-] | ||||||||||||||||||||||
I use CC at work, so I haven't explored other options. Is there a better one to use locally? I presumed they were all going to be pretty similar. | |||||||||||||||||||||||
| |||||||||||||||||||||||
| ▲ | aplomb1026 6 hours ago | parent | prev | next [-] | ||||||||||||||||||||||
[dead] | |||||||||||||||||||||||
| ▲ | storus 8 hours ago | parent | prev [-] | ||||||||||||||||||||||
Can't you use Claude caveman mode? | |||||||||||||||||||||||