Remix.run Logo
jdross 3 days ago

The pace of notable releases across the industry right now is unlike any time I remember since I started doing this in the early 2000's. And it feels like it's accelerating

achierius 3 days ago | parent | next [-]

How is this a notable release? It's strictly worse than Gemini 2.5 on coding &c, and only an iterative improvement over their own models. The only thing that struck me as particularly interesting was the native visual reasoning.

og_kalu 3 days ago | parent [-]

It's not worse on coding. SWE Bench, Aider, live bench coding all show noticeably better results.

qoez 3 days ago | parent | prev | next [-]

Lots of releases but very little actual performance increases

emp17344 3 days ago | parent | prev [-]

Not really. We’re definitely in the incremental improvement stage at this point. Certainly no indication that progress is “accelerating”.