| ▲ | gymbeaux 5 hours ago | |||||||||||||||||||||||||||||||
Those numbers are abysmal. Should we really be using LLMs to write our code? I have a theory- LLMs can spit out code that gets the job done and looks ok, maybe even great, but contains small “anomalies” that compound over time. An enterprise app developed entirely with LLM-happy devs might end up virtually unmaintainable. I’m not sure how to explain it, but the more I see LLM-written code the more I feel it’s bad code doing a good job of masquerading as good code. I think this take will become less-hot in the next year or two when we see enterprise greenfield projects that were created entirely with LLM “assistance” go to prod. I think we’ll find that the code is difficult for humans to read, understand, debug, and extend- and I think the larger the codebase the harder it will be for LLMs to maintain. More opportunity for hallucination, larger context windows needed, more tokens bought and spent for smaller and smaller code changes. I think the more code an LLM writes for an app, the worse that codebase becomes. | ||||||||||||||||||||||||||||||||
| ▲ | andybak 20 minutes ago | parent | next [-] | |||||||||||||||||||||||||||||||
I can't help but feel that people continually underestimate how bad human written code becomes over time. The exception is probably single-person passion projects or open source projects that maintain quality governance over time. I strongly suspect most closed source code developed under commercial or internal pressure is pretty awful after a few years of development. All LLM code has to do is suck less than existing code. And that's presuming the code quality doesn't improve as the models, the harnesses and our ways of working with them improve. | ||||||||||||||||||||||||||||||||
| ▲ | realaleris149 an hour ago | parent | prev | next [-] | |||||||||||||||||||||||||||||||
Take a look at a sufficiently old random internal repo which was not written with LLMs and compare. My observation is that they are equally bad and hard to maintain or even more so than the new ones. One thing I’ve noticed is that the LLM assisted ones have a lot more comments which is nice but take more time to read. | ||||||||||||||||||||||||||||||||
| ▲ | xvinci 2 hours ago | parent | prev | next [-] | |||||||||||||||||||||||||||||||
Not my observation. If you never look at the code and dont have basic guardrails in place (linters, architecture tests, some guidelines for best practices) - probably. But as soon as you do minimal reviews and high-level corrections, applications turn out just fine. Can there be bugs? Sure. That's the price of not reading or understanding every line. It should depend on the criticality of your software how much of these you tolerate and how much you don't (reviewing, understanding, testing everything 100% like you were used to if you had written it yourself will kill most if not all of your gained speed) But I never got the impression of unmaintainability or unfixable bugs. Actually the other side around: A really good cleanup pass, architectural changes, or bugfixes are seldom more than a few prompts and 2 hours away, provided your overall base is decent and you actually gave a fuck from the start. | ||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||
| ▲ | realusername 3 hours ago | parent | prev | next [-] | |||||||||||||||||||||||||||||||
> code that gets the job done and looks ok, maybe even great, but contains small “anomalies” that compound over time They clearly are only assistants for the moment, you can use them to do work ... but only if you could do the said work yourself alone in the first place. | ||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||
| ▲ | Foobar8568 3 hours ago | parent | prev [-] | |||||||||||||||||||||||||||||||
Have you worked with enterprise apps? The ones I have used for decades are hot garbages. | ||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||