| ▲ | aeldidi 3 hours ago | ||||||||||||||||||||||||||||
There's an odd trend with these sorts of posts where the author claims to have had some transformative change in their workflow brought upon by LLM coding tools, but also seemingly has nothing to show for it. To me, using the most recent ChatGPT Codex (5.3 on "Extra High" reasoning), it's incredibly obvious that while these tools are surprisingly good at doing repetitive or locally-scoped tasks, they immediately fall apart when faced with the types of things that are actually difficult in software development and require non-trivial amounts of guidance and hand-holding to get things right. This can still be useful, but is a far cry from what seems to be the online discourse right now. As a real world example, I was told to evaluate Claude Code and ChatGPT codex at my current job since my boss had heard about them and wanted to know what it would mean for our operations. Our main environment is a C# and Typescript monorepo with 2 products being developed, and even with a pretty extensive test suite and a nearly 100 line "AGENTS.md" file, all models I tried basically fail or try to shortcut nearly every task I give it, even when using "plan mode" to give it time to come up with a plan before starting. To be fair, I was able to get it to work pretty well after giving it extremely detailed instructions and monitoring the "thinking" output and stopping it when I see something wrong there to correct it, but at that point I felt silly for spending all that effort just driving the bot instead of doing it myself. It almost feels like this is some "open secret" which we're all pretending isn't the case too, since if it were really as good as a lot of people are saying there should be a massive increase in the number of high quality projects/products being developed. I don't mean to sound dismissive, but I really do feel like I'm going crazy here. | |||||||||||||||||||||||||||||
| ▲ | RealityVoid 2 hours ago | parent | next [-] | ||||||||||||||||||||||||||||
You're not going crazy. That is what I see as well. But, I do think there is value in: - driving the LLM instead of doing it yourself. - sometimes I just can't get the activation energy and the LLM is always ready to go so it gives me a kickstart - doing things you normally don't know. I learned a lot of command like tools and trucks by seeing what Claude does. Doing short scripts for stuff is super useful. Of course, the catch here is if you don't know stuff you can't drive it very well. So you need to use the things in isolation. - exploring alternative solutions. Stuff that by definition you don't know. Of course, some will not work, but it widens your horizon - exploring unfamiliar codebases. It can ingest huge amounts of data so exploration will be faster. (But less comprehensive than if you do it yourself fully) - maintaining change consistency. This I think it's just better than humans. If you have stuff you need to change at 2 or 3 places, you will probably forget. LLM's are better at keeping consistency at details (but not at big picture stuff, interestingly.) | |||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||
| ▲ | FeteCommuniste 2 hours ago | parent | prev | next [-] | ||||||||||||||||||||||||||||
There's got to be some quantity of astroturfing going on, given the players and the dollar amounts at stake. | |||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||
| ▲ | yusufnb 2 hours ago | parent | prev | next [-] | ||||||||||||||||||||||||||||
At my work I interview a lot of fresh grads and interns. I have been doing that consistently for last 4 years. During the interviews I always ask the candidates to show and tell, share their screen and talk about their projects and work at school and other internships. Since last few months, I have seen a notable difference in the quality and extent of projects these students have been able to accomplish. Every project and website they show looks polished, most of those could be a full startup MVP pre AI days. The bar has clearly been raised way high, very fast with AI. | |||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||
| ▲ | mikenew 2 hours ago | parent | prev | next [-] | ||||||||||||||||||||||||||||
Pretty much every software engineer I've talked to sees it more or less like you do, with some amount of variance on exactly where you draw the line of "this is where the value prop of an LLM falls off". I think we're just awash in corporate propaganda and the output of social networks, and "it's good for certain things, mixed for others" is just not very memetic. | |||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||
| ▲ | dan-robertson 30 minutes ago | parent | prev | next [-] | ||||||||||||||||||||||||||||
I think it’s just very alien in that things which tend to be correlated in humans may not be so correlated in LLMs. So two things that we expect people to be similarly good at end up being very different in an AI. It does also seem to me that there is a lot of variance in skills for prompting/using AI in general (I say this as someone who is not particularly good as far as I’m aware – I’m not trying to keep tips secret from you). And there is also a lot of variance in the ability for an AI to solve problem of equal difficulty for a human. | |||||||||||||||||||||||||||||
| ▲ | nevster 11 minutes ago | parent | prev | next [-] | ||||||||||||||||||||||||||||
I think the main thing is, these are all green fields projects. (Note original author talking about executing ideas for projects.) | |||||||||||||||||||||||||||||
| ▲ | kylecazar 2 hours ago | parent | prev | next [-] | ||||||||||||||||||||||||||||
Matches my experience pretty well. FWIW, this is the opinion that I hear most frequently in real life conversation. I only see the magical revelation takes online -- and I see a lot of them. | |||||||||||||||||||||||||||||
| ▲ | peab an hour ago | parent | prev | next [-] | ||||||||||||||||||||||||||||
> ... but also seemingly has nothing to show for it This x1000, I find it so ridiculous. usually when someone hypes it up it's things like, "i have it text my gf good morning every day!!", or "it analyzed every single document on my computer and wrote me a poem!!" | |||||||||||||||||||||||||||||
| ▲ | an hour ago | parent | prev | next [-] | ||||||||||||||||||||||||||||
| [deleted] | |||||||||||||||||||||||||||||
| ▲ | AstroBen an hour ago | parent | prev | next [-] | ||||||||||||||||||||||||||||
We're at the apex of the hype cycle. I think it'll die down in a year and we'll get a better picture of how people have integrated the tools Even if it's not straight astroturfing I think people are wowed and excited and not analyzing it with a clear head | |||||||||||||||||||||||||||||
| ▲ | 2 hours ago | parent | prev | next [-] | ||||||||||||||||||||||||||||
| [deleted] | |||||||||||||||||||||||||||||
| ▲ | mark_l_watson 36 minutes ago | parent | prev | next [-] | ||||||||||||||||||||||||||||
“Emperor wore no clothes” moment. Given time AI will lead to incredible productivity. In the meantime, use as appropriate. | |||||||||||||||||||||||||||||
| ▲ | gchamonlive 31 minutes ago | parent | prev | next [-] | ||||||||||||||||||||||||||||
I'm curious what types of tasks you were delegating to the coding agents? | |||||||||||||||||||||||||||||
| ▲ | LogicFailsMe 2 hours ago | parent | prev | next [-] | ||||||||||||||||||||||||||||
I find these agents incredibly useful for eliminating time spent on writing utility scripts for data analysis or data transformation. But... I like coding, getting relegated to being a manager 100%? Sounds like a prison to me not freedom. That they are so good at the things I like to do the least and still terrible at the things at which I excel. That's just gravy. But I guess this is in line with how most engineers transition to management sometime in their 30s. | |||||||||||||||||||||||||||||
| ▲ | chrisjj 2 hours ago | parent | prev | next [-] | ||||||||||||||||||||||||||||
> if it were really as good as a lot of people are saying there should be a massive increase in the number of high quality projects/products being developed. The headline gain is speed. Almost no-one's talking about quality - they're moving too fast to notice the lack. | |||||||||||||||||||||||||||||
| ▲ | daliusd 2 hours ago | parent | prev | next [-] | ||||||||||||||||||||||||||||
Maybe it is language specific? Maybe LLMs have a lot of good JavaScript/TypeScript samples for training and it works for those devs (e.g. me). I heard that Scala devs have problems with LLMs writing code too. I am puzzled by good devs not managing to get LLM work for them. | |||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||
| ▲ | hawkernews an hour ago | parent | prev | next [-] | ||||||||||||||||||||||||||||
I remember when Anthropic was running their Built with Claude contest on reddit. The submissions were few and let's just say less than impressive. I use Claude Code and am very pro-AI in general, but the deeper you go, the more glaring the limitations become. I could write an essay about it, but I feel like there's no point in this day and age, where floods of slop in fractured echo chambers dominate. | |||||||||||||||||||||||||||||
| ▲ | philipwhiuk 2 hours ago | parent | prev | next [-] | ||||||||||||||||||||||||||||
It's like CGP Grey hosting a productivity podcast despite his productivity almost certainly going down over time. It's the appearance of productivity, not actual productivity. | |||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||
| ▲ | fragmede an hour ago | parent | prev | next [-] | ||||||||||||||||||||||||||||
The crazy pills you are taking is that thinking people have anything to prove to you. The C compiler that Anthropic created or whatever verb your want to use should prove that Claude is capable of doing reasonably complex level of making software. The problem is people have egos, myself included. Not in the inflated sense, but in the "I built a thing a now the Internet is shitting on me and I feel bad" sense. There's fundcli and nitpick on my GitHub that I created using Claude. fundcli looks at your shell history and suggests places to donate to, to support open source software you actually use. Nitpick is a TUI HN client. I've shipped others. The obvious retort is that those two things aren't "real" software; they're not complex, they're not making me any money. In fact, fundcli is costing me piles of money! As much as I can give it! I don't need anyone to tell me that or shit on the stuff I'm building. The "open secret" is that shipping stuff is hard. Who hasn't bought a domain name for a side project that didn't go anywhere. If there's anybody out there, raise your hand! So there's another filtering effect. The crazy pills are thinking that HN is in any way representative of anything about what's going on in our broader society. Those projects are out there, why do you assume you'll be told about it? That someone's going to write an exposé/blog post on themselves about how they had AI build a thing and now they're raking in the dollars and oh, buy my course on learning how to vibecode? The people selling those courses aren't the ones shipping software! | |||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||
| ▲ | rulerviper 8 minutes ago | parent | prev | next [-] | ||||||||||||||||||||||||||||
[dead] | |||||||||||||||||||||||||||||
| ▲ | g-mork 2 hours ago | parent | prev [-] | ||||||||||||||||||||||||||||
> it's incredibly obvious that while these tools are surprisingly good at doing repetitive or locally-scoped tasks, they immediately fall apart when faced with the types of things that are actually difficult in software development and require non-trivial amounts of guidance and hand-holding to get things right I used this line for a long time, but you could just as easily say the same thing for a typical engineer. It basically boils down to "Claude likes its tickets to be well thought out". I'm sure there is some size of project where its ability to navigate the codebase starts to break down, but I've fed it sizeable ones and so long as the scope is constrained it generally just works nowadays | |||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||