Remix.run Logo
user- 8 hours ago

I am a believer that everyone should have their main flow be model/provider agnostic at a high level. I often run out of claude tokens and use GLM-5 as backup.

https://gist.github.com/ManveerBhullar/7ed5c01a0850d59188632...

simple script i use to toggle which backend my claude code is using

bob1029 8 hours ago | parent | next [-]

I tried the agnostic thing for a while, but there are enough quirks between the providers that I gave up trying to normalize it. GPT5.x wipes the floor with other models for my specific tool calling scenarios. I am not going to waste time trying to bridge arbitrary and evolving gaps between providers.

I put my Amex details into OAI, I get tokens, it just works. I really don't understand what the hell is going on with Claude. The $200/m thing is so confusing to me. I'd rather just go buy however many tokens I plan to use. $200 worth of OAI tokens would go a really long way for me (much longer than a month), but perhaps I am holding it wrong.

fastball 8 hours ago | parent | prev | next [-]

Being model and provider agnostic are orthogonal concerns.

e.g. you can run Claude models on AWS Bedrock giving you provider choice for the same model. Whether or not you need model agnosticism at that point seems like a very different question.

user- 8 hours ago | parent [-]

> . you can run Claude models on AWS Bedrock giving you provider choice for the same model

Is anyone doing this for personal dev that isnt token fed by employers? Coding plans are subzidized for a reason right? If I did API usage from a cloud provider id be out tens of thousands already.

cyanydeez 8 hours ago | parent | prev | next [-]

Interesting; do you find they actually react the same way to the harness?

user- 8 hours ago | parent [-]

There are differences for sure. Claude models feel the most 'stable' in that I see less tool confusion messages and other mistakes like the one im looking at right now.

"Wait, I'm editing the wrong sections. The edit tool tried to match but replaced with different prop names than what was in the file. Let me re-read the file and understand the current state properly."

And of course models are not 1-to-1 and have different strengths and weaknesses. I know I wont get the same quality plan mode output probably. Its a tradeoff.

cyanydeez 6 hours ago | parent [-]

I generally assume the differences could be minimized by tailoring the instructions to the models; that they're not incapable of doing the same things, but the way in which they're instructed matters because it needs to draw on training.

But I don't use any of the cloud stuff; I'm local4lyfe.

boxingdog 8 hours ago | parent | prev [-]

[dead]