Remix.run Logo
kachapopopow 2 hours ago

I think people should stop comparing to sonnet, but to opus instead since it's so far ahead on producing code I would actually want to use (gemini 3 pro tends to be lacking in generalization and wants things to be using it's own style rather than adapting).

Whatever benchmark opus is ahead in should be treated as a very important metric of proper generalization in models.