Remix.run Logo
usernametaken29 2 hours ago

> The shortest path for the model is to implement it completely

Have you worked with LLMs??????????? “I disabled the test so it’s not run so now all the tests pass” is not a hypothetical it’s pretty common. LLMs frequently do shortcut learning. The reason why reviews are expensive is because you still need to do all the steps in order to understand if a shortcut is justified.

nomel 2 hours ago | parent | next [-]

> “I disabled the test so it’s not run so now all the tests pass”

Also:

"I implemented it this terrible way because of precedence in the codebase...that I just wrote"

"I avoided implementing this correctly because of migration concern for existing installations of this code I'm writing right now"

"I deferred this critical feature for the future, so we can deploy quicker"

or, my favorite,

"I hand rolled an buggy http server because you said the tool should be self contained"

bot403 an hour ago | parent [-]

Or bizarrely Claude askes about code churn like it matters.

Human, Do you want me to do it the right way? It will cause code churn in 90 files. Or I can take a shortcut and edit 3 files in a terrible way.

Edits 90 files for 12 lines each in 25 seconds...

tobyhinloopen 43 minutes ago | parent [-]

“This is a significant rewrite that will take weeks”

Done after my potty break

bcrosby95 an hour ago | parent | prev [-]

I used Fable to write a relatively small RPG. In the span of 2 hours it managed to do many interesting things. My favorite was when it wrote code with a race condition that could cause people to take more damage than they should, which it then defended as an acceptable tradeoff for parallelism.