| ▲ | kachapopopow 2 hours ago | |
I think people should stop comparing to sonnet, but to opus instead since it's so far ahead on producing code I would actually want to use (gemini 3 pro tends to be lacking in generalization and wants things to be using it's own style rather than adapting). Whatever benchmark opus is ahead in should be treated as a very important metric of proper generalization in models. | ||