Remix.run Logo
aidenn0 9 hours ago

I first encountered the following concept in one of Oxide's publications; good chance it didn't originate there though:

There is an implicit social contract with writing that the writer has put more effort into writing than the reader will need to read something. Sure you get crackpots still, but there are only so many Gene Rays in this world, so the volume is limited.

I think the same applies to PRs. Pre-AI , it was usually obvious when a PR was either completely terrible or very half-baked, and the required effort to create even a shitty PR was usually more than that required to reject it.

AI makes it trivial to make a completely terrible PR, and much easier to make a not-immediately-obviously-bad PR.

toponijo 8 hours ago | parent | next [-]

Given this, you can conclude that writers should be putting in at least at much effort as readers, whether or not they use an LLM. What really seems to be the problem is writers that don't at least check their own work, and pass that burden onto the readers. This is easier than ever with LLMs.

This is toxic behavior that unfortunately rewards a selfish writer. I'm worried the AI push incentivizes this too much, to where in corporate situations a reader can't say no to doing work for a selfish writer.

kentm 8 hours ago | parent | next [-]

Its exactly this. I have had a few LLM coding sessions where I reviewed the resulting work and thought "I don't think my team can safely PR this." I then went back and broke it down into smaller PRs, still using LLMs but at a size that is easy to review. And I reviewed the output myself before I asked a reviewer to commit their time.

The problem is that this is increasingly seen as a non-productive workflow slowing everyone else down, so the pressure is growing for writers to just shove massive PRs out the door and reviewers to use LLMs to make that tractable. I suppose those advocates have more faith in LLM output compared to humans than I do.

greiskul 6 hours ago | parent | next [-]

Thats the thing with giant PRs. They never really needed to be reviewed anyway. In cultures with strong review culture I have worked at, if you send me a thousand line PR and ask me to review it, I will look at the giant blob of text, and immediately fire off a "it's too long, can you cut it into smaller PRs?".

Because I don't trust myself to review a giant PR. It takes too much cognition to properly review it.

And now that people are making PRs with AI, this is even more important. If the AI was good enough to have coded it, please instruct it to make the changes in reviewable chunks.

majormajor 6 hours ago | parent | prev | next [-]

> the pressure is growing for writers to just shove massive PRs out the door and reviewers to use LLMs to make that tractable

Even in these move-fast envs, it should be reasonably apparent for people to realize that the author should be using the LLM to make the PR tractable, not solely using the LLM to shovel out a giant PR + slop PR description.

And the LLMs can often do this - if you ask to restructure or break up a big change differently, they can often make quite reasonable suggestions and help with it. That's just not what you're gonna get if you're lazy. If you want a small LLM-generated change, often you have to start with a big one then ask it to figure out what it can get rid of, since many times it doesn't have perfect model of all the code in it's "head" before it starts spitting stuff out. The big companies have been doing their best to automate this for the last couple of years vs the even-more-blind attempts you used to get, but there's still the issue of the models+tools following generic advice aimed at median codebases vs being intimately familiar with this codebase.

You can go fast without being lazy. And when going fast, in some ways, it's more important than ever to put in that effort to not blowing things up.

kentm 6 hours ago | parent [-]

It should be but often isn't. There's been a lot of threads on HN where the response to huge PRs wasn't "Don't do that, use AI when authoring better" but "The reviewers are actually the problem, they're missing the AI train". And I see this in industry too.

gedy 7 hours ago | parent | prev [-]

> I suppose those advocates have more faith in LLM output compared to humans than I do.

Some of this is the funny situation where the faithful will state: "This writes better code than I do!" and miss the irony of: "yes, yes it does"

themgt 7 hours ago | parent | next [-]

Some of this is the funny situation where the faithful will state: "This writes better code than I do!" and miss the irony of: "yes, yes it does"

"Blessed are the humble ..."

ErroneousBosh 7 hours ago | parent | prev [-]

> "This writes better code than I do!" and miss the irony of: "yes, yes it does"

I guess it depends on what you consider "better". I've tried using LLMs to write code over the past couple of weeks with extremely mixed results.

The LLM certainly writes more interesting code! They like their cute ASCII/unicode animations, don't they?

It definitely writes a lot more code, none of it actually correct but some of it functionally similar to correct code.

If you like lots of code then I guess that's better. I like less code.

kentm 6 hours ago | parent | next [-]

I find it can often write correct code but not maintainable, performant, or reviewable code without additional human guidance. The "solution" frequently given is that humans don't need to maintain it anymore so its not actually a problem. But the agent can't be accountable for mistakes, so unless that changes or the risk of a defect is close to zero, one still has to put forth effort to keep the code maintainable.

To be fair, there are plenty of situations where throwaway code is perfectly fine and/or defect risks are low enough to make the trade-off worth it. I don't think a lot of developers are thinking about it in that context, though.

(No unit tests aren't enough)

TylerE 4 hours ago | parent | prev | next [-]

> They like their cute ASCII/unicode animations, don't they?

One of the few global Cluade directives I have setup is to never use emojis - and it never has, either in chat output or in code. Don't blame the tool when you don't spend 30 seconds configuring it. It's even easier with AI since you don't have to go digging for some obscure .vimrc snippet - it's literally just plain English.

gedy 6 hours ago | parent | prev [-]

Yes I basically meant those folks weren't very good developers to begin with and now extrapolating to: "wow this is better than all devs!", when it's more like "it's you, dude"

ErroneousBosh 6 hours ago | parent [-]

This sounds awfully like the people who think that self-driving cars and even auto-braking systems will eliminate all accidents, because everyone else is as bad a driver as they are.

ryandrake 2 hours ago | parent [-]

Someone pointed out[1] a while ago that LLMs look good at things you are bad at. Which is I think one of the best explanations of why so many people disagree about how good they are at programming. There are a lot of people really bad at programming, and they will look at the output if an LLM and say “Wow, it’s so much better than my code!”

1: https://news.ycombinator.com/item?id=48315309

RealityVoid 6 hours ago | parent | prev | next [-]

I've had a guy once reply in an email with a bug report generated by ChatGPT telling me that some piece of software I wrote wasn't working. He just plopped right there the discussion he had with the confirmation bias machine confirming 100% that what he had in front was sending spurious messages. With all the information in the world at its disposal, the LLM did not consider informing the reporter that maybe his code should flush the serial device pipe before starting his processing. I stopped short from facetiously replying to him that maybe he should use another model, since his seems to be broken.

cgio 3 hours ago | parent | prev | next [-]

I think that this is the essence of the argument. I spent two weeks of long nights across a few different sessions with millions of tokens generated, to produce a 5 page proof. I think we have come to the age of the aesthetics of curation. At least I don’t feel like I broke this silent contract. The effort you put in driving the torrent of words before distilling it is a new art. First time I tried it and it felt more creative than slop. The judgement nevertheless lies on the eye of the beholder.

XorNot 7 hours ago | parent | prev [-]

Isn't this just a continuation of the performance art of the modern corporate environment though? There's an entire industry producing pages of documents which aren't read, aren't responded to, but need to be at least X lines long for anyone to take them "seriously".

Then suddenly LLMs happened and it's like the mask is off: no one's reading them still, but also no one is writing them either.

Which is perhaps a drop in the ocean of the insanity which is "we need you to work on the Jira tasks" as basically a job title.

inigyou 6 hours ago | parent [-]

You're absolutely right! The modern commercial sector has been writing bullshit, bullshit, and more bullshit, and become completely disconnected from the actual outputs of its work. And it has to be, because if only useful work was done, two thirds of the population would be unemployed without benefits and would revolt so they didn't starve.

alexanderdmitri 42 minutes ago | parent | prev | next [-]

I think this is from Sartre's "Literature & Existentialism."

I'm not a big fan of his generally, but I highly recommend this book in particular. A lot of what he wrote there really resonated for me.

cyanydeez 6 hours ago | parent | prev | next [-]

if anyones got an active issues list, the maintainers should close all issues and open only ones they intend to fix. A bot should repeat this message to the issuer and point them to issues they're accepting.

So at a minimum, you could maybe fish some useful work out. PRs should be the same.

ErroneousBosh 7 hours ago | parent | prev [-]

> Sure you get crackpots still,

They've still put more effort into writing their crackpottery than you will put into reading it, and at worst it's entertaining. The late Ivor Catt's articles on "the death of electric current" - where he expounds the idea that current and indeed electric charge does not exist, because of stuff involving Maxwell's equations where the maths looks about right to me but I'm not a good enough mathematician to prove - were pretty damn odd, but his writing in 1989 on how it would be vital for an interconnected network of computers for information sharing to treat censorship as damage and route around it and some ideas for doing this was bang on the money (as we now see) and his writings on how American business management methods result in the worst possible outcome for everyone that's not already a billionaire have also proven oddly prophetic.

So maybe there's something in the crackpots after all.

aidenn0 7 hours ago | parent [-]

Maybe lots of (most?) people are crackpots about something, but they lack the time and/or resolve to do something about it.

ErroneousBosh 6 hours ago | parent [-]

I wonder what I'm crackpotty about? Forth, probably, although I did actually port Dave Dunfield's 6809 Forth to a mid-1980s sampler. It boots off a floppy, I know what the memory map is, if it boots off a floppy I can make it run anything, right?

How about this crackpot view? Perl vs. Python, which I guess has been replaced by Rust vs. Go - I prefer Python and Go to Perl and Rust, simply because I know them better. If you want me to work in Rust or Perl I don't really care, they're just languages. I'm not as proficient in them, expect it to take longer. Rewrite it all in Rust? Sure. I'd prefer not to, but if that's today's project then shut up and pay my invoice.

Let's see, what other things are wild crackpot ideas around here?

I don't think LLMs are very good.

I don't think self-driving cars solve the right problem.

Permaculture would be better for long-term ecological sustainability and food security than "everyone should be vegan".

Bikes would work perfectly well in American cities if you used enough of them.