Remix.run Logo
XCSme 5 hours ago

Gpt 5.5 is quite a big leap, it's a lot better than opus 4.7 for agentic coding

energy123 5 hours ago | parent | next [-]

Arena only allows very small context sizes, so it's a noisy benchmark for what we care about IRL.

mettamage 3 hours ago | parent | prev [-]

Better in what ways? I'm just curious about your experience.

XCSme 3 hours ago | parent [-]

Consistency, not making mistakes.

mettamage 3 hours ago | parent [-]

Ahh... that is indeed an issue I have with Claude. I'll check it out!