Remix.run Logo
einpoklum 8 hours ago

> Today’s coding benchmarks have established that models can write correct code.

I wouldn't say that.

> But as AI-generated code becomes the dominant path to production

I really hope that's not the case.

zakisaad 8 hours ago | parent [-]

How do you define "correct" code?

newsicanuse 6 hours ago | parent [-]

The code that gets stuff done instead of beating around the bush making unxpected errors

vanuatu an hour ago | parent [-]

i suspect this is highly dependent on what you're working on

from my experience if you give the models a way to self-verify correctness they succeed basically 100% of the time