Remix.run Logo
ifwinterco 2 days ago

For coding Opus 4.5 in q3 2025 was still the best model I've used.

Since then it's just been a cycle of the old model being progressively lobotomised and a "new" one coming out that if you're lucky might be as good as the OG Opus 4.5 for a couple of weeks.

Subjective but as far as I can tell no progress in almost a year, which is a lifetime in 2022-25 LLM timelines

_air a day ago | parent | next [-]

Opus 4.5 was released on Nov 24 last year. It’s only been 5 months!

ifwinterco a day ago | parent [-]

Wow you're right, okay not so bad then.

That brief two week period when Opus could eat entire tickets was simultaneously fantastic and a bit alarming

dannyw 2 days ago | parent | prev [-]

Another annoyance (for more API use) is summarized/hidden reasoning traces. It makes prompt debugging and optimization much harder, since you literally don't have much visibility into the real thinking process.