Remix.run Logo
groby_b 5 days ago

"that every model is either about to be great or was great in the past but now is not"

FWIW, Codex-CLI w/ ChatGPT5 medium is great right now. Objectively accelerating me. Not a coding god like some posters would have it, but overall freeing up time for me. Observably.

Assuming I haven't had since-cured delusions, the same was true for Claude Code, but isn't any more.

Concrete supporting evidence: From time to time, I have coding CLIs port older projects of varying (but small-ish) sizes from JS to TS. Claude Code used to do well on that. Repeatedly. I did another test last Sunday, and it dug a momentous hole for itself that even liberal sprinkling of 'as unknown' everywhere couldn't solve. Codex managed both the ab-initio port and was able to undig from CC's massive hole abandoned mid-port.

So I'd say the evidence points somewhat against random process, given repeated testing shows clear signal both of past capability and of recent loss of capability.

The idea that it's a "random" process is misguided.