Remix.run Logo
unsnap_biceps 9 hours ago

lm studio offers an Anthropic compatible local endpoint, so you can point Claude code at it and it'll use your local model for it's requests, however, I've had a lot of problems with LM Studio and Claude code losing it's place. It'll think for awhile, come up with a plan, start to do it and then just halt in the middle. I'll ask it to continue and it'll do a small change and get stuck again.

Using ollama's api doesn't have the same issue, so I've stuck to using ollama for local development work.

keerthiko 8 hours ago | parent | next [-]

Claude Code is fairly notoriously token inefficient as far as coding agent/harnesses go (i come from aider pre-CC). It's only viable because the Max subscriptions give you approximately unlimited token budget, which resets in a few hours even if you hit the limit. But this also only works because cloud models have massive token windows (1M tokens on opus right now) which is a bit difficult to make happen locally with the VRAM needed.

And if you somehow managed to open up a big enough VRAM playground, the open weights models are not quite as good at wrangling such large context windows (even opus is hardly capable) without basically getting confused about what they were doing before they finish parsing it.

unsnap_biceps 8 hours ago | parent | next [-]

I use CC at work, so I haven't explored other options. Is there a better one to use locally? I presumed they were all going to be pretty similar.

jaggederest 7 hours ago | parent | next [-]

If you want to experiment with same-harness-different-models Opencode is classically the one to use. After their recent kerfluffle with Anthropic you'll have to use API pricing for opus/sonnet/haiku which makes it kind of a non-starter, but it lets you swap out any number of cloud or local models using e.g. ollama or z.ai or whatever backend provider you like.

I'd rate their coding agent harness as slightly to significantly less capable than claude code, but it also plays better with alternate models.

blitzar 6 hours ago | parent [-]

I am hopeful the leaked claude code narrows the capability, perhaps even googles offering will be viable once they borrow some ideas from claude.

satvikpendem 2 hours ago | parent | prev [-]

OpenCode

aplomb1026 6 hours ago | parent | prev | next [-]

[dead]

storus 8 hours ago | parent | prev [-]

Can't you use Claude caveman mode?

https://github.com/JuliusBrussee/caveman

mbesto 6 hours ago | parent | prev [-]

I don't get why I would use Claude Code when OpenCode, Cursor, Zed, etc. all exist, are "free" and work with virtually any llm. Seems like a weird use case unless I'm missing something.

panagathon an hour ago | parent | next [-]

> I don't get why I would use Claude Code when OpenCode, Cursor, Zed, etc. all exist, are "free" and work with virtually any llm. Seems like a weird use case unless I'm missing something.

I'm with you on this. I've tried Gemma and Claude code and it's not good. Forgets it can use bash!

However, Gemma running locally with Pi as the harness is a beast.

superb_dev 5 hours ago | parent | prev | next [-]

From my experience, Claude Code is just better. Although I recently started using Zed and it’s pretty good

blitzar 6 hours ago | parent | prev | next [-]

previously I have found claude code to be just better than the alternatives, using large models or local. It is, however, closer now and not much excuse for the competition after the claude code leak. Personally, I will be giving this a go with OpenCode.

bdangubic 6 hours ago | parent | prev [-]

this is like asking why use intellij or vscode or … when there is vim and emacs

NamlchakKhandro 4 hours ago | parent [-]

No it's more like, why use a Microsoft paid for distro of nvim when lazyvim, astronvim exist