Kinda funny but my current feeling about it is different from a lot of people.

I did a lot of AI assisted coding this week and I felt, if anything, it wasn't faster but it led to higher quality.

I would go through discussions about how to do something, it would give me a code sample, I would change it a bit to "make it mine", ask if I got it right, get feedback, etc. Sometimes it would use features of the language or the libraries I didn't know about before so I learned a lot. With all the rubber ducking I thought through things in a lot of depth and asked a lot of specific questions and usually got good answers -- I checked a lot of things against the docs. It would help a lot if it could give me specific links to the docs and also specific links to code in my IDE.

If there is some library that I'm not sure how to use I will load up the source code into a fresh copy of the IDE and start asking questions in that IDE, not the one with my code. Given that it can take a lot of time to dig through code and understand it, having an unreliable oracle can really speed things up. So I don't see it as a way to gets things done quickly, but like pairing with somebody who has very different strengths and weaknesses from me, and like pair programming, you get better quality. This week I walked away with an implementation that I was really happy with and I learned more than if I'd done all the work myself.

▲

andy99 6 days ago | parent | next [-]

> I did a lot of AI assisted coding this week

Are you new to it? There's a pretty standard arc that starts with how great it is and ends with all the "giving up on AI" blog posts you see.

I went through it to. I still use a chatbot as a better stack overflow, but I've stopped actually having AI write any code I use - it's not just the quality, it's the impact on my thinking and understanding that ultimately doesn't improve outcomes over just doing it myself.

▲

PaulHoule 6 days ago | parent | next [-]

I've been doing it for a while. I never really liked stack overflow though it always seemed like a waste of time versus learning how to look up the real answers in the documentation. I never really liked agents because they go off for 20 minutes and come back with complete crap But if I can ask a question get an answer in 20 seconds and iterate again I find that's pretty efficient.

I’ve usually been skeptical about people who get unreasonably good results and not surprised when they wake up a few weeks later and are disappointed. One area where I am consistently disappointed is when there are significant changes across versions: I had one argue over where I could write a switch in Java that catches null (you can in JDK 21) and lots of trouble with SQLAlchemy in Python which changed a lot between versions. I shudder to think what would happen if you asked questions about react-router but actually I shudder to think about react-router at all.

▲

whynotminot 6 days ago | parent | prev | next [-]

When's the last time you "went through the loop" ? I feel like with this stuff I have to update my priors about every three or four months.

I've been using AI regularly since GPT 4 first came out a couple years ago. Over that time, various models from Sonnet to Gemini to 4o have generally been good rubber ducks. Good to talk to and discuss approaches and tradeoffs, and better in general than google + stack overflow + pouring over verbose documentation.

But I couldn't really "hand the models the wheel." They weren't trustworthy enough, easily lost the plot, failed to leverage important context right in front of them in the codebase, etc. You could see that there was potential there, but it felt pretty far away.

Something changed this spring. Gemini 2.5 Pro, Claude 4 models, o3 and o4-mini -- I'm starting to give the models the wheel now. They're good. They understand context. They understand the style of the codebase. And they of course bring the immense knowledge they've always had.

It's eerie to see, and to think about what comes with the next wave of models coming very soon. And if the last time you really gave model-driven programming a go was 6 months or more ago, you probably have no idea what's about to happen.

▲

ryandrake 6 days ago | parent | next [-]

Just one person's opinion: I can't get into the mode of programming where you "chat" with something and have it build the code. By the time I have visualized in my head and articulated into English what I want to build and the data structures and algorithms I need, I might as well just type the code in myself. That's the only value I've found from AI: It's a great autocomplete as you're typing.

To me, programming is a solo activity. "Chatting" with someone or something as I do it is just a distraction.

▲

whynotminot 6 days ago | parent | next [-]

A good part of my career has been spent pair programming in XP-style systems, so chatting away with someone about constraints, what we're trying to do, what we need to implement, etc, might come a bit more naturally to me. I understand your perspective though.

▲

skydhash 6 days ago | parent [-]

That may be one of the reason for the conflict of opinions. I usually build the thing mentally first, then code it, and then verify it. With tools like linters and tests, the shorter feedback make the process faster And editor fluency is a good boost.

By the time, I'm about to prompt, I usually have enough information to just code it away. Coding is like riding a bicycle downhill. You just pay enough focus to ride it. It's like how you don't think about the characters and the words when you're typing. You're mostly thinking about what you want to say.

When there's an issue, I switch from coding to reading and thinking. And while the latter is mentally taxing, it is fast as I don't have to spell it out. And a good helper to that is a repository of information. Bookmarks to docs, documentation browser, code samples,.. By the times the LLM replies with a good enough paragraph, I'm already at the Array page on MDN.

▲

lazyasciiart 6 days ago | parent [-]

Having it write unit tests has been one place it is reliably useful for me. Easily verifiable that it covers everything I’d thought of, but enough typing involved that it is faster than doing it myself - and sometimes it includes one I hadn’t thought of.

▲

skydhash 6 days ago | parent [-]

I’ve read somewhere, IIRC, that you moslty need to test three things: Correct input, incorrect input, and input that are on the fence between the two. By doing some intersection stuff (with the set of parameters, behavior that are library dependent), you mostly have a few things left to test. And the actual process of deciding on which case to test is actually important as that is how you highlight edge cases and incorrect assumptions.

Also writing test cases is how you experience the pain of having things that should not be coupled together. So you can go refactor stuff instead of having to initialize the majority of your software.

	▲	lazyasciiart 6 days ago \| parent [-]
		If you need to hand write tests to think through all that then sure, don’t let an AI do it for you.

▲

nicwolff 5 days ago | parent | prev | next [-]

I'm not chatting with the LLM – I'm giving one LLM in "orchestrator mode" a detailed description of my required change, plus a ton of "memory bank" context about the architecture of the app, the APIs it calls, our coding standards, &c. Then it uses other LLMs in "architect mode" or "ask mode" to break out the task into subtasks and assigns them to still other LLMs in "code mode" and "debug mode".

When they're all done I review the output and either clean it up a little and open a PR, or throw it away and tune my initial prompt and the memory bank and start over. They're just code-generating machines, not real programmers that it's worth iterating with – for one thing, they won't learn anything that way.

▲

tempodox 6 days ago | parent | prev [-]

That's my basic doubt, too. When developing software, I'm translating the details, nuances and complexities of the requirements into executable code. Adding another stage to this process is just one more opportunity for things to get lost in translation. Also, getting an LLM to generate the right code would require something else than the programming languages we know.

▲

andy99 6 days ago | parent | prev | next [-]

Interesting point, I agree that things change so fast that experience from a few months ago is out of date. I'm sceptical there has been a real step change (especially based on the snippets I see claude 4 writing in answer to questions) but it never hurts to try again.

My most recent stab at this was Claude code with 3.7, circa March this year.

To be fair though, a big part of the issue for me is that having not done the work or properly thought through how a project is structured and how the code works, it comes back to bite later. A better model doesn't change this.

	▲	theptip 6 days ago \| parent \| next [-]
		There has been a big change with Claude 4.0 in my opinion. Probably depends on your environment, but it’s the first time I’ve been able to get hundreds of lines of Python that just works when vibe coding a new project. It’s still slower going as the codebase increases in size, and this is my hypothesis for the huge variance; I was getting giddy at how fast I blew through the first 5 hours of a small project (perhaps in 30 mins with Claude) but quickly lost velocity when I started implementing tests and editing existing code.
	▲	whynotminot 6 days ago \| parent \| prev [-]
		If you give it another try, my goto right now is Sonnet 4 Thinking. There's a pretty massive difference in intelligence by switching from just plain 4 to 4 Thinking. It's still pretty fast, and I think hits the right balance between speed and useful intelligence. However, at least in my experience, nothing beats o3 for raw intelligence. It's a little too slow to use as the daily driver though. It's kind of fun seeing the models all have their various pros and cons. > To be fair though, a big part of the issue for me is that having not done the work or properly thought through how a project is structured and how the code works, it comes back to bite later. A better model doesn't change this. Yes, even as I start to leverage the tools more, I try to double down on my own understanding of the problem being solved, at least at a high level. Need to make sure you don't lose the plot yourself.

▲

otabdeveloper4 6 days ago | parent | prev [-]

> just try a newer model bro

Fundamentally none of the issues inherent in LLMs will be fixed by increasing parameter count or better weights.

Shilling for the newest model is a dead end, better to figure out how to best put LLMs to use despite their limitations.

▲

whynotminot 6 days ago | parent [-]

> Fundamentally none of the issues inherent in LLMs will be fixed by increasing parameter count or better weights.

Um, ok just disregard my entire post where I talk about how my issues with using LLMs for programming were literally solved by better models.

▲

otabdeveloper4 5 days ago | parent [-]

They weren't. And if you think they were, you're simply wrong.

Sorry but it is what it is.

	▲	whynotminot 4 days ago \| parent [-]
		Do you get very far at work with contrarian takes backed up without evidence or detail? How does dismissing other people’s experience so condescendingly work out for you in real life? I’m curious if/how you’ve built a career this way. Or is this account just a stress relief “shit-posting” persona?

▲

benreesman 5 days ago | parent | prev | next [-]

I'm hanging in there with it. The article was remarkably dispassionate about something that gets everyone all hot around here: easy stuff benefits from it a lot more than hard stuff. If your whole world is Node.js? Yeah, not a lot of deep water energy discovery geology data mining at exabyte scale or whatever. It pushed the barrier to entry on making a website with a Supabase backend from whatever it was to nearly zero. And I want to be really clear that frontend work is some of my favorite work: I love doing the web interface, it's visual and interactive and there's an immediacy that's addictive, the compiler toolchains are super cool, it's great stuff. I worked on a web browser, got some patents for it. I like this stuff.

But getting extreme concurrency outcomes or exotic hardware outcomes or low latency stuff or heavy numerics or a million other things? It's harder, the bots aren't as good. This is the divide: it can one shot a web page, even the older ones can just smoke a tailwind theme, headshot. A kernel patch to speed up a path on my bare metal box in a latency contest? Nah, not really.

But I see a lot of promise in the technology even if the sort of hype narrative around it seems pretty much false at this point: I still use the LLMs a lot. Part of that is that search just doesn't work anymore (although... Yandex got a lot better recently out of the blue of all things), and part of it is that I see enough exciting glimpses of like, if I got it hooked up to the right programming language and loaded the context right, wow, once in a while it just slams and it's kinda rare but frequent enough that I'm really interested in figuring out how to reproduce it reliably. And I think I'm getting better outcomes a little bit at a time, getting it dialed in. Two or three months ago an even with claude code would have me yelling at the monitor, now it's like, haha, i see you.

▲

resonious 6 days ago | parent | prev | next [-]

I've been back and forth, and currently heavily relying on AI-written code. It all depends on knowing what the AI can and can't do ahead of time. And what it can do often overlaps with grunt work that I don't enjoy doing.

▲

positron26 6 days ago | parent | prev [-]

> There's a pretty standard arc that starts with how great it is and ends with all the "giving up on AI" blog posts you see.

Wouldn't be shocked if this is related to getting ramped up.

Switch to a language you don't know. How will that affect your AI usage? Will it go up or down over time?

Had similar experiences during the SO early days or when diving into open source projects. Suddenly you go from being stuck in your bubble to learning all kinds of things. Your code gets good. The returns diminish and you no longer curiously read unless the library is doing something you can't imagine.

▲

gerdesj 6 days ago | parent | prev | next [-]

"I would go through discussions about how to do something"

Have you compared that to your normal debugging thought processes? I get that you might be given another way to think about the problem but another human might be best for that, rather than a next token guesser.

I have a devil of a time with my team and wider, the younger ones mainly, getting them to pick up a phone instead of sending emails or chats or whatever. A voice chat can solve a problem within minutes or even seconds instead of the rather childish game of email ping pong. I do it myself too (email etc) and I even encourage it, despite what I said earlier - effective use of comms is a skill but you do need to understand when to use each variety.

▲

furyofantares 6 days ago | parent | prev | next [-]

This is great, it's so easy to get into the "go fast" mode that this potential gets overlooked a lot.

▲

dbtc 6 days ago | parent | prev | next [-]

> It would help a lot if it could give me specific links to the docs

Just a super quick test: "what are 3 obscure but useful features in python functools. Link to doc for each."

GPT 4o gave good links with each example.

(its choices were functools.singledispatch, functools.total_ordering, functools.cached_property)

Not sure about local code links.

	▲	steveklabnik 6 days ago \| parent [-]
		I've had this return great results, and I've also had this return hallucinated ones. This is one area where MCPs might actually be useful, https://context7.com/ being one of them. I haven't given it enough of a shot yet, though.

▲

x86x87 6 days ago | parent | prev | next [-]

viewing it as an assistant is the way to go. it's there to help you - like an overpowered autocomplete - but not there to think for you.

▲

Shorel 6 days ago | parent | prev [-]

Some people copy and paste snippets of code without knowing what it does, and in a sense, they spread technical debt around.

LLMs lower the technical debt spread by the clueless, to a lower baseline.

You were part of the clueless, so an LLM improved your code and lowered the technical debt you would have spread.