Remix.run Logo
ary 3 hours ago

There are a number of comments here where people open up about their contrasting experiences of not being a part of a programming community. Those are well addressed, I think, but there is another point to consider.

We need to remember the people, that we may never talk to, that are downstream of all of this software. Not necessarily “the users” as there are many pieces of software meant for other devs, but I think the users deserve consideration nonetheless.

Handing over software quality to the stochastic code extruder is causing a sharp drop in the quality of software put out into the world. This is on top of all of the problems that existed before LLMs, like human error and perverse financial incentives. Shipping poor quality and user hostile software actually hurts people. Real people. Harm is caused in both big and little ways to living, breathing actual people. This “inevitable” slide into generative AI harms every single person it comes into contact with. The devs, the users, the investors, everyone. Those harms may happen at different times and in different ways and the creeping nature of it all might make it easier to ignore, but it’s happening.

“AI” is a blight. You can leave me behind as well.

unknownfuture 2 hours ago | parent | next [-]

> Handing over software quality to the stochastic code extruder is causing a sharp drop in the quality of software put out into the world.

I genuinely don't know if that's true and I doubt you do, either. It's all feels right now.

What I do know is I run a couple of personal projects and I can say they are of objectively higher quality now that I'm using AI to build out proper CI pipelines, expand test coverage, produce higher quality architectures, etc.

Why?

Because in the past I didn't have the capacity to invest in that kind of hardening, but with AI, now I do.

Of course you'll probably make the claim that my code is probably crap, the tests suck, etc, because you've already made up your mind. But having been in the industry for 25 years, I can tell you definitively that you'd be wrong about that.

Now, what'll happen to the median codebase? God only knows. Maybe I'm especially diligent.

But given we're really only 6-12 months into the agentic coding era, I think the only conclusion you can make is that the jury is still out.

daveidol 2 hours ago | parent | next [-]

Well said. People love to make everything black and white / good and bad. But things are rarely that simple.

unknownfuture 2 hours ago | parent [-]

And I get that compulsion to boil this mess down into a simple good/bad dichotomy.

I absolutely have deeply mixed feelings about these tools, the ethics associated with them, the impact on the industry, on the talent pipeline, etc.

But I also can't deny that they are incredibly powerful tools that are here to stay in one form or another.

And I say that as someone who, a year ago, was absolutely convinced that they were incremental at best and scoffed at everyone who said something like "yeah but they're so much better now!" or "they're only going to get better!"

Well, they were right, they did, and the world has changed. AI generated code is landing in the Linux kernel. 250+ security holes were found and fixed in Firefox. The impact is here and now, and it's mixed and ugly and complicated.

dleeftink an hour ago | parent | prev | next [-]

We would need a lot of anecdata. Would we trust a doctor/pilot/engineer of 25 years more with the tried and tested (but now aging) kit of their time, or the acolyte that has been fully absorbed by the latest and greatest and has now more than doubled their productivity?

While code quality may go up on a case by case basis, we should be mindful whether we are comparing it to our own personal baselines, or to the average code quality across the board. I.e. will the baseline regress to the mean, or raise the floor for the average coder?

prerok an hour ago | parent | prev | next [-]

Even if the jury is still out, I would still say you are both right already.

The amount of slop produced even in company setting is staggering and I don't like it one bit that neither the submitter nor the reviewer of the PR paid due dilligence. And I am only complaining because it then becomes my problem. So, then I have to start nagging people to clean that up. I can say with 100% certainty that the problems I face now would not have happened without LLMs.

That said, used with care, with proper supervision, with dilligence to review what LLMs did, I still think they can be and are beneficial.

I think that we are just not used to getting results of questionable quality from the tools we use. So, I am hopeful that we will learn and it will improve with time but still find myself dreading the age of the vibe coder.

unknownfuture an hour ago | parent [-]

Very well said.

I also think reading and reviewing code is a skill that connected to but very much independent of the writing of code, and the use of coding agents requires us to be far more skilled and diligent at it.

So put another way, people who were good at coding without agents may in fact be a poor fit with them, which means the entire industry is experiencing a dislocation between skills we have and skills we need, leading to extremely bimodal outcomes.

prerok an hour ago | parent [-]

Indeed, however I would also point out that senior engineers have already been expected to be good at reading code: they were expected to evaluate the code quality of other contributors, so they had to be able to do that.

In fact, from my personal experience, going from junior to mid to senior, that was the hardest thing. Reading the code and thinking if what they did was really correct and will not have additional undesired side-effects was hard to become efficient at (it didn't help that we were working in C back then).

So, really, I think that for juniors it's actually much harder because if they want to do due dilligence they have to do the same evaluation but without the years of experience working with that code base. I can understand, even if I don't like it, that they just submit the output of the LLM for the senior to review.

unknownfuture 10 minutes ago | parent [-]

> Indeed, however I would also point out that senior engineers have already been expected to be good at reading code: they were expected to evaluate the code quality of other contributors, so they had to be able to do that.

Yeah, but the frequency, volume, and complexity of that activity, and its ratio versus all the other work that a developer was previously expected to do, has shifted dramatically, not least because now we're having to review the output of our own coding agents as well as that of other developers on our teams.

As a consequence, folks who were marginal but capable at that skill now likely find themselves working beyond their ability.

> So, really, I think that for juniors it's actually much harder because if they want to do due dilligence they have to do the same evaluation but without the years of experience working with that code base. I can understand, even if I don't like it, that they just submit the output of the LLM for the senior to review.

Yup, couldn't agree more.

sillyfluke an hour ago | parent | prev [-]

It's a bit weird when the article in question is predominantly about software development in a professional setting and the top comment is about how some people in thread are disregarding this context and opining unrelatedly about their unique solo development or personal project development experiences, to then respond to said comment by insistently going on about how AI is great for your personal projects, when people are unable to assess the value of your AI-assisted personal projects and whether they would concur with the high opinion you have of them. A turd with a CI pipeline is still a turd, I think we can all agree on that. IF someone said AI is great because they can now expand test coverage and build a CI pipeline for their todo app in rust, it wouldn't exactly be the proof you're looking for I don't think.

But I agree fully with your last paragraph, and said something similar in a comment elsewhere where I stated my tangible bar as being a Ladybird like browser built from scratch achieving Chrome parity in six months while doing continuous stable releases with coding agents in tow. Otherwise, as you said, the jury is still out.

MarkSweep 2 hours ago | parent | prev | next [-]

I suspect you are right that LLM-generated software will likely negatively impact people's lives. The flip side of this is there is going to be a lot of software generated that would have never been possible before. And for some use cases, some crappy software is better than no software. I think it's hard to predict whether on net this will be a good or bad thing.

20k 2 hours ago | parent [-]

>And for some use cases, some crappy software is better than no software

The best use case i've seen for AI is people generating random one shot projects for themselves, which is honestly so cool. You can make some basic app that does something very specific, that would have taken objectively a lot longer to make by hand. This is when 'crappy' software is more than good enough for a specific problem

castedo 2 hours ago | parent [-]

Similar use cases of useful AI one-shot projects are: demos, proof-of-concept, prototype, exploratory tests, etc... I've AI-generated an HTML/javascript craplet to test how browser security would behave for certain JavaScript API calls. I just wanted to confirm how browser security would behave before I started to spend time hand-coding the non-AI quality software that I wanted to release.

And I say this as somebody who generally has a low opinion of AI generated code and rarely uses it. But it has its place.

nradov 2 hours ago | parent | prev | next [-]

Nah, there's no evidence of reduced quality. If anything it's the reverse. I've seen AI code review tools be tremendously effective at catching defects which otherwise would have shipped.

philipp-gayret an hour ago | parent | next [-]

In the DORA group's reporting on AI-assisted software engineering, they indeed state that across industries, quality goes UP with the use of AI assistants like Claude and others.

Moreover, in my experience helping businesses on this topic; They never defined or made measurable what quality meant in the first place. Then when they finally do figure it out, it turns out that the average repository is a total disappointment in terms of absolute quality.

prerok an hour ago | parent | prev | next [-]

Same for me. I have not yet successfully used an LLM to generate code to produce a feature, though to be fair, it might just be because I don't have the patience to go back and forth with it (will readily admit that I am using it wrong).

Also, while some code review comments are just plain wrong, LLMs did produce some damn good comments, on the same level as a different senior engineer might note had they taken the time to study the code carefully.

jimmaswell an hour ago | parent | prev [-]

Same here, it's been catching a lot of bugs that would have been very hard to trace or discover, often just in the process of doing something else.

simonw 2 hours ago | parent | prev | next [-]

> Shipping poor quality and user hostile software actually hurts people. Real people.

I couldn't agree more. If you're using AI tools to produce worse software, faster, you should rethink how you are using them.

If we're not delivering better software with this stuff then what are we using it for?

aspenmartin 3 hours ago | parent | prev | next [-]

> Handing over software quality to the stochastic code extruder is causing a sharp drop in the quality of software put out into the world.

Well, first of all you and the author point to the same derisive comment of these models being, in your words "stochastic code extruder" or the one I have heard a lot "next-token predictors", and the connotation I read from these being that this makes them inherently dumb or unintelligent and I don't understand that. The fact that these "stochastic code extruders" can solve Erdos problems is sort of the proof in the pudding. Next token prediction is profound in that it is _a very simple objective_ yet it is _enough_ to take you to extraordinary heights.

Also I wonder how many folks honestly look in the mirror and think: how does the median programmer differ from an LLM. Do you really think humans are universally better and produce universally higher quality code? Not even universally, I would say _typically_. I would trust an LLM to not write a buffer overflow far more than a junior or a mediocre senior engineer. LLMs have built things in my domain that are non-trivial and impressive and correct.

Not to mention, these systems are following a predictable trend in performance improvement so these worries about quality just won't age well, and it seems to be a head-in-the-sand attitude that pretends like quality and reliability are not getting very very good _already_.

> Shipping poor quality and user hostile software actually hurts people.

Could not agree more. So why do you think humans are inherently better at this?

> This “inevitable” slide into generative AI harms every single person it comes into contact with.

I just don't quite understand this, is it that: (1) agentic code is inherently inferior to human code and thus (2) shipping agentic code is actively harmful?

castedo 10 minutes ago | parent | next [-]

> The fact that these "stochastic code extruders" can solve Erdos problems is sort of the proof in the pudding.

This claim is very misleading and not really true. It reflects the kind of exaggeration and spin made by corporate marketing. I would not call this a fact at all. Like many claims made by for-profit marketing, if one looks into the details and think critically about what is being claimed, one can see that consumers are jumping to false conclusions.

That said, it is very cool how an LLM helped human mathematicians in the recent specific Erdos problem solution announced by OpenAI. Just don't jump to the conclusion that anybody can input any Erdos problem into an LLM and a solution will come out the other end.

nine_k 2 hours ago | parent | prev | next [-]

It's like people complaining about "poor quality plastic trinkets" that replaced well-made household items. Of course high-quality things can be (and are) made of plastic. The problem is that you can still make a very cheap passable thing out of plastic, that would be uneconomical to make out of metal or wood.

Same with code: by using AI, one can produce passable software trinkets very cheaply, that would be uneconimical to produce by paying poor-quality human developers.

The floor has moved downwards, allowing to produce a flood of new, trash-quality, disposable code very cheaply. It does not mean that we'll have to use only that code. But unfortunately we'll have to live with it, too.

customguy an hour ago | parent [-]

> It's like people complaining about "poor quality plastic trinkets" that replaced well-made household items.

Actually, people had to be made to see it as cheap so they would throw away and re-buy more.

https://desis.osu.edu/seniorthesis/index.php/2024/09/15/crea...

> “It was a really difficult sell to the American public in the post-war period, to inculcate people into a throwaway living,” she says. “That is not what people were used to.”

> A solution companies came up with was emphasizing that plastic was a low-cost, abundant material.

> A 1960 marketing study for Scott Cup said the containers were “almost indestructible,” but that the manufacturer could still convince people to discard them after a few uses. To counter any “pangs of conscience” consumers might feel about throwing them away, the researchers suggested a “direct attack”: Tell people the cups are cheap, they said, and that “there are more where these came from.”

> A few years later, Scott ran an advertisement saying its plastic cups were available at “‘toss-away prices.”

It wasn't plastic itself, and likewise it's not "AI" itself.

We do have an abysmal track record as industrialized nations however, and more recently, in many parts of the software industry.

But we can change it. With so many things, tech people spent so much time and energy debating... like cookies or HTTPS or whatever... we often heard/said that while we care so much about doing the right thing, we can't achieve anything at work because consumers don't care. Well, this time, pretty much all of the world cares a lot. I mean, the Vatican just blogged about it!

Maybe we just "have to live with it", but in that case, there is also no utility in pointing that out, since we literally have to live with it. And of course, it's really about the shape of the "it", and how it's used, not that there is one that will never go away. That is also true about most things: stuff we don't currently use is in the museum or text books. Nothing goes away away, but we no longer drink out of lead cups, even though we still use lead. We don't have x-ray machines in shoe shops, even though we still use x-rays.

nine_k 33 minutes ago | parent [-]

I agree, "AI" is being pushed beyond reason, in an attempt to make it look like what it's not, or at least not yet. And yes, people do act irrationally due to that, see all that tkenmaxxing and layoffs. I don't think it will last in this inebriated state forever.

But it's worth noting that there's a new substance enabled by AI, the "slop", that is flushing violently into our world right now. Pretending it's not there, and that we won't see more of it, is perilous.

nomel 37 minutes ago | parent | prev | next [-]

> Also I wonder how many folks honestly look in the mirror and think: how does the median programmer differ from an LLM.

Once you step out of pure-software orgs, it becomes clear that most would benefit from having AI write code. There's a huge moat between most people and the point where they can afford/find the effort of someone that can write software.

These people, that only care about practical results rather than somewhat tangential things like "elegance" and "maintainability", are going to benefit tremendously.

mbgerring 2 hours ago | parent | prev | next [-]

>> Shipping poor quality and user hostile software actually hurts people.

> Could not agree more. So why do you think humans are inherently better at this?

Because humans are capable of empathy

aspenmartin 2 hours ago | parent | next [-]

Why is that a prerequisite? There are entire philosophies about what makes good design for UI's etc. Why can't models figure this out? Why do you feel this is some sort of mystical thing out of reach?

unknownfuture 2 hours ago | parent | prev [-]

Given the UIs I've experienced over the years, I'd dispute that assumption...

hansmayer 2 hours ago | parent | prev [-]

> to the same derisive comment of these models being, in your words "stochastic code extruder"

So many excited and insulted LLM adopters on this thread. There is nothing derisive in that comment, it is simply the purest possible definition of how they work. Stochastics is a branch of maths you know.

> can solve Erdos problems is sort of the proof in the pudding

For the non-engineer, non-mathematician it may sound authoritative, but you'd probably be surprised to learn that most of Erdos problems are not at all complex - they are just not very interesting or relevant. So it is a proof in the pudding, provided the pudding is made of shit - the kind of stuff LLMs produce most of the time.

> I just don't quite understand this, is it that: (1) agentic code is inherently inferior to human code and thus (2) shipping agentic code is actively harmful?

Yes and yes - have you not heard of that AWS incident with Kiro when the "agentic" shit deleted an entire infrastructure environment, complete with data, config, etc.?

> Also I wonder how many folks honestly look in the mirror and think: how does the median programmer differ from an LLM

Apart from the obvious absurdity of this statement - I know a lot of you non-engineer types feel "empowered" by the LLMs, in the sense of how they immediately seem a genius when you ask them on a topic you are not expert in, but you may want to read a book on programming first - maybe you'll get a clue then.

aspenmartin 2 hours ago | parent [-]

> So many excited and insulted LLM adopters on this thread.

neither excited nor insulted.

> There is nothing derisive in that comment, it is simply the purest possible definition of how they work. Stochastics is a branch of maths you know.

Not sure what you mean by stochastics but this is more statistics. They are trained with a next token loss, that doesn't belie "how they work".

> For the non-engineer, non-mathematician it may sound authoritative, but you'd probably be surprised to learn that most of Erdos problems are not at all complex - they are just not very interesting or relevant.

It sounds like you are both an engineer and a mathematician? Can you confirm? These are problems unsolved for many years. You think no good mathematicians have taken a stab at them, even if just to say they have resolved an unsolved Erdos problem? They are "not at all complex" is quite an extraordinary thing to say I'm wondering if you actually do have the pedigree you are trying to make it sound like you have, or if you are just regurgitating the same HN talking points you've heard.

> Yes and yes - have you not heard of that AWS incident with Kiro when the "agentic" shit deleted an entire infrastructure environment, complete with data, config, etc.?

And this means agentic code is inherently inferior to human code? Howso?

> Apart from the obvious absurdity of this statement - I know a lot of you non-engineer types feel "empowered" by the LLMs, in the sense of how they immediately seem a genius when you ask them on a topic you are not expert in, but you may want to read a book on programming first - maybe you'll get a clue then.

in the beginning you mentioned there were a lot of "excited and insulted LLM adopters" and yet...this sounds quite excited and defensive. Believe it or not, I am not a "non-engineer type" and its telling you assume that people who don't seem to share the same opinion as you are somehow less qualified than I assume you think you are? Why is this statement obviously absurd. Maybe you work in a really talented engineering team, which kudos to you I also have worked in teams like this, and I have also seen what is the p50 engineer and they are just as error prone or more than Claude. Thank you for the advice to read a book on programming as if that somehow would have any bearing on this at all?

hansmayer an hour ago | parent [-]

> an engineer and a mathematician

An engineer with an engineering degree, which as it may still be known to some, requires a fairly stringent mathematical underpinning. So yes, I know a thing or two - read up on Erdos and his problems, I am not here to enlighten every vibecoding PM that shows up.

> And this means agentic code is inherently inferior to human code? Howso?

Again, I am not here to explain the world to some clueless PM. You have your LLMs for that :) But for the sake of bringing you closer, the "agentic" code is often very inferior, implementing happy paths or just bluntly exposing secrets in clear texts, etc. Probably a consequence of it being trained on, as you put it "p50 engineering code".

> Maybe you work in a really talented engineering team,

Running my own company and been paying the LLM-Shit-Generators for my whole team for a long time, in the hope they would bring the advertised benefits. Guess what - for serious use-cases, they bring shit and more shit.

> Thank you for the advice to read a book on programming as if that somehow would have any bearing on this at all?

Oh yeah obviously not, I mean, its not like understanding software development would help you understand how LLMs are not similar to a "p50 engineer" at all:). I'd take the latter over the former every time.

> Why is this statement obviously absurd

Well for one, LLMs are not humans, but it should be obvious to even to most cretinous of the e/acc crowd. It's not like they can think in abstract terms or come up with completely new concepts. But then again, don't mind me - if you can live with below average AI slop - go for it.

martin-t an hour ago | parent | prev | next [-]

Steve Jobs motivated Larry Kenyon, the engineer working on bootup, by saying he is effectively saving lives.

I wonder if we reverse that. When engineers write shitty, slow, buggy code or when they're forced to do that by the company, they are effectively killing people.

After all, if I spend an hour dealing with a preventable bug, that's an hour of my live I am never getting back. Multiple that by the userbase and you get entire lifetimes.

[0]: https://www.folklore.org/Saving_Lives.html

idiotsecant 2 hours ago | parent | prev [-]

I think history will prove that this is a less nuanced view than is required to accurately describe the situation. Abandoning human agency through the use of generative AI harms us all. Using AI as a force multiplier to implement human agency helps us all. It's possible to recognize that asking AI to do everything results in a poor product and brain rot for the humans. It's not at all clear that this is the case for using AI to build boilerplate, help with tests, etc.

prerok an hour ago | parent [-]

I've always felt we somewhat failed as engineers (I included, of course) when I was doing boilerplate by hand. We should have taken the time to get those autogenerated. But... it takes time and the generator is always more complex than the produced code, sometimes even by a few factors, so it also takes an expert to maintain.

As for tests, I've seen LLMs produce good ones as well as useless ones. I guess it's all about instructions... sorry, prompts... no, sorry, prompt engineering to get it right and done properly.

That said, I am also very concerned with brain rot. Engineers nowadays can commit code they don't understand without a blink of the eye. Slowly, the knowledge may get sparse if we are not careful about it.