| ▲ | nsingh2 2 hours ago | ||||||||||||||||
Why supply underspecified requirements in the first place? Both models are good at challenging assumptions/edge cases and asking questions to clarify, but seemingly only when explicitly asked (i.e. something like a "brainstorm" skill). I don't think either harnesses do enough to encourage the model to challenge all assumptions and ask questions, maybe because users might find it annoying. That step is basically a requirement IMO. I've found all of the GPT-5 models to be very nit-picky, useful for code review and mathematics (important for my work), but seemingly gets in the way of "aesthetic" code, e.g. overly defensive code to cover all edge cases, even if unlikely. There is seemingly also a tradeoff between flexibility vs instruction following. In my experience Opus will sometimes ignore instructions but can "fill in the blanks" more, vs GPT-5.5 follows instructions better but perhaps at the cost of rigidity. | |||||||||||||||||
| ▲ | fooker an hour ago | parent | next [-] | ||||||||||||||||
> Why supply underspecified requirements in the first place? Because you'd not want to forever loop outside your home when asked to "while you're out, grab some eggs" :) | |||||||||||||||||
| ▲ | antonvs 2 hours ago | parent | prev [-] | ||||||||||||||||
> Why supply underspecified requirements in the first place? Minimizes effort, is the obvious answer. | |||||||||||||||||
| |||||||||||||||||