| ▲ | xthunk 5 hours ago | |
Really interesting note. That echoes thoughts I’ve had about how much automated benchmark scores really reflect production‑ready code. For me the big takeaway is that passing doesn't automatically mean it is maintainable, follows established patterns / conventions or have unexpected side effects that real reviewers care about. | ||