Remix.run Logo
oofbey 2 days ago

Oh I’ve had agents remove tests plenty of times. Or cripple the tests so they pass but are useless - more common and harder to prompt against.

maddmann 2 days ago | parent [-]

Ah true, that also can happen — in aggregate I think models will tend to expand codebases versus contract. Though, this is anecdotal and probably is something ai labs and coding agent companies are looking at now.

oofbey 2 days ago | parent [-]

It’s the same bias for action which makes them code up a change when you genuinely are just asking a question about something. They really want to write code.