Remix.run Logo
psadauskas 5 hours ago

I was using Claude until they banned Opencode, and now use GPT at my day job. I've been using Deepseek through Opencode Go on the $10/mo plan, and I honestly can't really tell much difference. Its just as capable, and makes the same kinds of dumb mistakes and the other two have been making since March. For the price, I'm more than happy with it.

sankaritan 23 minutes ago | parent | next [-]

It's interesting. 95% of time you don't need the extra 5% rigor that frontier models provide to you compared to the 10-100x cheaper Chinese equivalents.

The remaining 5% of time you get a big boost for your high-reasoning problem solving needs and evade a lot of pain. Now, I just need to be able to predict accurately when I need this extra 5% and when not :)

powerapple 2 minutes ago | parent [-]

the extra 5% time you will need to help AI with multiple turns and information it needed. These 5% time reasoning rarely is enough to finish the task. i.e. 5% time AI is just not enough to complete the task without a lot help.

selfawareMammal an hour ago | parent | prev | next [-]

I have both subscriptions and I definitely feel gpt is better and more consistent, but when I run out of limits I don't miss it too much

miroljub 16 minutes ago | parent [-]

That's the whole point. The tool you have vs. the expensive tools you don't have because they're too expensive.

I don't feel like paying 100 times the price for a 1-5% better tool.

joystick_0x0 2 hours ago | parent | prev [-]

I am not sure what I am doing wrong then. I am using claude the last 7 months and from time to time try other models like deepseek, kimi etc. Nothing can come even close to it. Claude is almost evrytime (99.99%) one shot.

InsideOutSanta 24 minutes ago | parent | next [-]

In my experience, there is a very specific use case of one-shotting complex, long tasks with relatively vague or incomplete descriptions where Opus does substantially better than all other models I've tried, including GPT 5.5, GLM 5.1 and DS4. It seems to be better at inferring unstated requirements and creating a complete, working, reasonably well-designed solution.

However, that's probably not how most professional developers use LLMs. I tend to give well-specified, more constrained tasks, and for those, I find that Opus performs worse than other models precisely because it tends to infer unstated requirements and do things I didn't want it to do. In this situation, GPT 5.5 works better for me because it only and precisely does what I ask it to.

skerit 28 minutes ago | parent | prev | next [-]

Same here. Claude isn't perfect. It still makes a lot of mistakes. But whenever I try GPT-5.5 it's ten times worse, and Claude just has to clean up GPT's mess.

OtomotO an hour ago | parent | prev [-]

You're obviously not doing anything wrong if it works for you.

It worked for me too, for months, when I was working on trivial web projects.

Around February of this year it got lobotomized and I quit my subscription end of march.

I am not going back.