Remix.run Logo
cmiles8 3 days ago

The story that’s solidifying is the tech is cool, it’s useful for certain things (eg, meeting note taking), but business have run a ton of “innovation lab” pilots that have returned little to no measurable value with leaders getting frustrated at the invested red ink. In short the substance isn't living up to the hype.

Everywhere I look the adoption metrics and impact metrics are a tiny fraction of what was projected/expected. Yes tech keynotes have their shiny examples of “success” but the data at scale tells a very different story and that’s increasingly hard to brush under the carpet.

Given the amount of financial engineering shenanigans and circular financing it’s unclear how much longer the present bonanza can continue before the financial and business reality playing out slams on the brakes.

jp8585 3 days ago | parent | next [-]

I actually think things improved substantially when compared to last year. The latest batch of sota models is incredible (just ask any software engineer about what’s happening to their profession). It’s only a matter of time until other knowledge workers start getting the asphyxiating “vibe” coding treatment and that drama is what really fascinates me.

People are absolutely torn. It seems that ai usage starts as a clutch, then it becomes an essential tool and finally it takes over the essence of the profession itself. Not using it feels like a waste of time. There’s a sense of dread that comes from realizing that it’s not useful to “do work” anymore. That in order to thrive now, we need to outsource as much of your thinking to GPT as possible. If your sense of identity comes from “pure” intellectual pursuits, you are gonna have a bad time. The optimists will say “you will be able to do 10x the amount of work”. That might be true, but the nature of the work will be completely different. Managing a farm is not the same as planting a seed.

Terretta 3 days ago | parent | next [-]

There’s a sense of dread that comes from realizing that it’s not useful to “do work” anymore. That in order to thrive now, we need to outsource as much of your thinking to GPT as possible. If your sense of identity comes from “pure” intellectual pursuits, you are gonna have a bad time.

This is 180 degrees from how to think about it.

The more thinking you do as ratio to less toil, the better. The more time to apply your intellect with the better machine execution to back that up, the more profit.

The Renaissance grand masters used ateliers of apprentices and journeymen while the grand masters conceived, directed, critiqued, and integrated their work into commissioned art; at the end signing their name: https://smarthistory.org/workshop-italian-renaissance-art/

This is how to leverage the machine. It's your own atelier in a box. Go be Leonardo.

jp8585 3 days ago | parent | next [-]

I definitely understand that this is the rational way of viewing it. Leveraging these tools is an incredible feeling, but the sense of dread is always there in the corner. You can just feel a deep sense of angst in a lot of these interviews. In any case, I would rather have them and use them to their full extent than to become obsolete. Becoming Leonardo it is.

jonplackett 2 days ago | parent [-]

If you are capable of being a leonardo, then this approach will work.

Not everyone is capable of being Leonardo

jp8585 2 days ago | parent [-]

I know, right? That’s part of the angst these professionals suffer. Failure, despite having the infinite leverage provided by these tools.

wongarsu 3 days ago | parent | prev | next [-]

The catch is that many professional environments have evolved values that above a certain quality floor reward quantity over quality. Even more so in the US where pointless torment is "work ethic" and pausing to think something through is "lazy" (see Bill Gate's famous quote about hiring lazy people, or "work smarter, not harder" almost being a rebel motto).

Granted, that's not everywhere. There are absolutely places where you will be recognized for doing amazing work. But I think many feel pressured to use AI to produce high volumes of sub-par work instead of small volumes of great work

godelski 2 days ago | parent [-]

  > see Bill Gate's famous quote about hiring lazy people
I think this is part of why all this is so contentious. There's been a huge culture shift over the last decade and AI is really just a catalyst to it. We went from managers needing to stop engineers from using too much abstraction and optimizing what doesn't need to be optimized to the engineers themselves attacking abstraction. Just look how people turn Knuth's "premature optimization is the root of evil" went from "get a profiler before you optimize" to "optimization? Are you crazy?"

Fewer and fewer people I know are actually passionate about programming and it's not uncommon to see people be burned out and just want to do their 9-5. And I see a strong correlation with these people embracing AI. It makes sense if you don't care and are just trying to get the job done. I don't think it's surprising things are getting buggier and innovation slowed. We killed the passion and tried to turn it into a mechanical endeavor. It's a negative feedback loop

nl 2 days ago | parent [-]

I've been programming professionally since the 1990s and our software has never been less buggy.

When was the last time you rebooted your OS, or even restarted your browser?

Software has never been as high quality as it is now.

godelski 2 days ago | parent [-]

I think you're looking at different timeframes and different types of bugs.

The last 5 years have been a drastic change for me. For over a decade I've used a Linux desktop and a laptop with various OSes. But in the last 5 years we went from my Linux desktop (Arch of all things!) from having "typical Linux issues" to my macbook having 10x more (and my arch machine being incredibly stable).

Yes, big picture stuff everything is stable and general purpose computers have blurred the line with servers.

BUT there's also a lot of frustrating day to day bugs that did not exist even a year ago. This Apple keyboard bug [0]? Infuriating! I personally hit a similar issue multiple times a day where, despite auto correct being disabled, it will change the word previous to the one I'm typing, even though it was already correct. Worse, pressing backspace delete two words. I'm with this user from years ago... "Am I getting older or is it becoming unbearable?"[1]

Many of the big picture things? Fine. But that doesn't mean I'm not being killed by a million little cuts. That's the problem. They're everywhere and when you complain about any single instance it is easy to brush off. But it isn't a single instance. It is eating hours of my day in 5 sec intervals. That's "buggy as shit"

[0] https://news.ycombinator.com/item?id=46232528

[1] https://news.ycombinator.com/item?id=33256168

polo 2 days ago | parent | prev | next [-]

“It's your own atelier in a box. Go be Leonardo.”

So well put. 100% agree. Paraphrasing Steve Jobs I think of it as a mech suit for the mind.

pertymcpert 2 days ago | parent | prev [-]

I don't necessarily agree with you completely, but I think that's a really great analogy. At the very least full of optimism.

zdragnar 2 days ago | parent [-]

It's a fundamentally flawed analogy. Leo's apprentices learned and improved. They studied under a master and faced serious repercussions if they bullshitted about their ability or what they had accompolished.

LLM capabilities are tied to their model, and won't improve on their own. You learn the quirks of prompting them, but they have fixed levels of skill. They don't lie, because they don't understand concepts such as truth or deception, but that means they'll spout bullshit and it's up to you to review everything with a skeptical eye.

In this analogy, you aren't the master, you're one part client demanding work, one part the janitor cleaning up after their mistakes.

Terretta 21 hours ago | parent | next [-]

All my LLM assistants are in night school learning and improving. At least I assume, since they're on such an astonishing pace of improvement.

solumunus 2 days ago | parent | prev [-]

> one part the janitor cleaning up after their mistakes.

More often their master simply pointing out what they did wrong and instructing them to fix or improve it.

vkou 2 days ago | parent | prev | next [-]

I'm a professional developer, using SOTA systems, and dealing with them is like bargaining with a fucking empathy vampire. It is emotionally draining.

They trick the reptilian part of your brain that you're dealing with something resembling a human being, but if they were one, they'd be described as a pathological liar and gaslighter. You can't go off on them for it, because they don't give a shit, and you shouldn't go off on them for it, because making a habit of that will make you a spiteful, unpleasant piece of shit for your coworkers to be around.

It's one thing when a machine or a tool doesn't function in the way you intend it to. It's another when this soulless, shameless homunculus does.

jbs789 2 days ago | parent [-]

That’s an interesting point. I do get pretty tired of the “you are right!” I get the upsides to engagement for a chat bot but for real work it is quite draining.

delusional 3 days ago | parent | prev [-]

> just ask any software engineer about what’s happening to their profession

I'm a professional developer, and nothing interesting is happening to the field. The people doing AI coding were already the weakest participants, and have not gained anything from it, except maybe optics.

The thing that's suffocating is the economics. The entire economy has turned its back on actual value in pursuit of silicon valley smoke.

latentsea 3 days ago | parent | next [-]

Nothing interesting happening in the field? If you've been paying attention the trend over the last two years has been that the problem space that requires humans to solve has been shrinking. It's continuing to shrink. That's interesting. Significantly interesting.

As an engineer that's lead multiple teams including one at a world leading SaaS company, I don't consider myself one of the weakest participants in the field and neither do my peers generally. I'm long on agents for coding, and have started investing heavily in making our working environment productive not only for humans, but now for agents too.

jondwillis 3 days ago | parent | next [-]

So what does that amount to? Shared Claude code hooks and skills?

latentsea 2 days ago | parent [-]

Things like that are only part of it. You can also also up your agents batting average by finding ways to build guardrails and things that inject the right context at the right time.

Like for instance we have a task runner in our project that provides a central point to do all manner of things like linting, building, testing, local deployment etc. The build, lint and test tasks are shared between local development and CI. The test tasks run the tests, take the TRX files and use a library to parse it to produce a report. So the agent can easily get access to the same info as CI is putting out about test failures. The various different test suites output reports under a consistent folder structure, they also write logs to disk under a consistent folder structure too. On failure the test tasks output a message to look at the detailed test reports and cross-reference that with the logs to debug the issue. Where possible the test reports contain correlation IDs inlined into the report.

With the above system when the agent is working through implementing something and the tests don't pass, it naturally winds up inspecting the test reports, cross referencing that with the logs, and solving the problems at a higher rate than compared to just taking a wild guess at how to run the tests and then do something random.

Getting it to write it's own guardrails by creating Roslyn Analyzers to make the build fail when it deviates from the project architecture and conventions has been another big win.

Tonnes of small things like that start to add up.

Next on my list is getting a debug MCP server, so it can set breakpoints and step through code etc.

qotgalaxy 2 days ago | parent | prev [-]

[dead]

jp8585 3 days ago | parent | prev | next [-]

That’s fascinating. If you don’t mind me asking, what type of software development do you do? Have you tried any of the latest coding tools? Or even used LLMs as a replacement for stack overflow?

delusional 2 days ago | parent [-]

Professionally, I do banking. It's a lot of integration work, sprinkled with a little algorithm every now and then. Lately I've been on capital requirements. The core of that is a system called AxiomSL, which is quite a lot of work for one guy to keep running.

In my spare time I write some algorithmic C, you can check that stuff out on github (https://github.com/DelusionalLogic) if you're curious.

I was an early adoter of LLM's. I used to lurk in the old EleutherAI discord and monitor their progress in reconstructing GPT-2 (I recall it being called GPT-J). I also played around a bunch with image generation. At this point nobody really tried applying them to code. We were just fascinated that it wrote back at all.

I have tried most of the modern models for development. I find then to generate a lot of nonsensical and unexplainable code. I've had no success (in the 30 or so times I've tried) at getting any of the models to debug or develop even small features. They usually get lost in some "best practice" and start looping on that forever. They're also constantly breaking style and violating module boundaries.

If i use them to generate documentation I find it to be surface level and repetitive. It'll make a lot of text about structures that are obvious to me just glancing at the code, but will (obviously) not have any context about the thought process that created that code, which is the only part I care about. I can read the code just fine myself. This is the same problem I find in commit messages generated with AI tools.

For the reversing I also do, I find the models to be too imprecise. It'll take large logical leaps that ruin understanding of the code I'm trying to understand. This is the only place I actually believe a properly trained (not a chatbot) model could actually succeed past the state of the art.

I don't really use stackoverflow either, I don't trust its accuracy, and it's easy to get cargo culted in software. I generally try to find my answers in official documentation, and if I can't get that I'll read the source code. If that's unavailable I'll take a guess, or reverse the thing If it's really important to me.

frizlab 3 days ago | parent | prev | next [-]

I would love to be able to say the same but I’m literally the only last person in the company still not using AI to code (if anything for ethics reasons, but I also truly do not need it at all), and I am obviously not the only good dev in the company. The gain is highly debatable (especially in delivery, I do not trust self-reports), however there have been recent reports of morale improvement since using AI, so at least there’s that.

nl 2 days ago | parent | prev [-]

I'm a decent dev and I'm possibly 100 times as productive using AI.

It lets me concentrate on the parts I'm good at and ignore things I don't care about. Claude is a lot better at React than I am and I don't care.

delusional 2 days ago | parent [-]

100 times? You do in a day what used to take you 3 months?

Those are just not realistic numbers.

nl 2 days ago | parent [-]

Yes.

In the last 2 months I’ve built 3 end-to-end micro-SAAS apps.

It would easily take me a year to do that in the past because I hated some parts so much I'd just avoid working on them.

Now it's literally "bite down for and evening and it's done" whereas it could be 6 months or more before.

If anything 100 times is underestimating how much more I'm doing.

doug_durham 3 days ago | parent | prev | next [-]

That's not what the data shows. Read the posting, and read Anthropic's original report. I found it a very sober, grounded report on the reality of using today's tools.

blindhippo 3 days ago | parent | prev [-]

If anything, the AI bubble is reinforcing to me (and hopefully many more people) that the "markets" are anything but rational. None of the investments going on have followed any semblance of fundamentals - it's all pure instinct and chasing hype. I just hope it doesn't tear down the world for the 99% of us unable to actually reap any benefits from it.

AI is basically a toy for 99% of us. It's a long long ways away from the productivity boost people love to claim to justify the sky high valuations. It will fade to being a background tech employed strategically I suspect - similar to other machine learning applications and this is exactly where it belongs.

I'm forced to use it (literally, AI usage is now used as a talent review metric...) and frankly, it's maybe helped speed me up... 5-10%? I spend more time trying to get the tools to be useful than I would just doing the task myself. The only true benefit I've gotten has been unit test generation. Ask it to do any meaningful work on a mature code base and you're in for a wild ride. So there's my anecdotal "sentiment".

dionian 3 days ago | parent | next [-]

I multi task much more now that i can farm off small coding assignments to agents. i pay hndreds per month in tokens. for my role personally its been a massive paradigm shift.

blindhippo 3 days ago | parent | next [-]

Might work for you, but if I multi task too much, the quality of my output drops significantly. Where I work, that does not fly. I cannot trust any agent to handle anything without babysitting them to avoid going off the rails - but perhaps the tools I have access to just aren't good (underlying model is claude 4.5, so it the model isn't the cause).

I've said this in the past and I'll continue to say it - until the tools get far better at managing context, they will be hard locked for value in most use cases. The moment I see "summarizing conversation" I know I'm about to waste 20 minutes fixing code.

dionian 3 days ago | parent | next [-]

I think it depends on the project and the context, but I developed my own task management system particularly because of this challenge. I'm starting to extend this with verification gates as well.

If I worked on different types of systems with different types of tasks I might feel the same way as you, i think AI works well in specific targeted use cases, where some amount of hallucination can be tolerated and addressed.

What models are you using, I use opus 4.5, which can one shot a surprising ratio of tasks.

fragmede 3 days ago | parent | prev [-]

If you can predict that hitting “summarize conversation” equals rework, what can you change upstream so you avoid triggering it? Are you relying on the agent to carry state instead of dumping it into .MD files? What happens if your computer crashes?

> so it the model isn't the cause

Thing is, the prompts, those stupid little bits of English that can't possiu matter all that much? It turns out they affect the models performance a ton.

cmiles8 3 days ago | parent | prev | next [-]

There are absolutely folks like you out there and I don’t doubt the productivity increase. The challenge is you are not the norm and the hundreds per month from you and others like you are a drop in the bucket of what’s needed to pay for all this.

WhyOhWhyQ 3 days ago | parent | prev [-]

To each his own, but multi-tasking feels bad to me. I want to spend my life pursuing mastery of a craft, not lazily delegating. Not that everyone should have the same goals, but the mastery route feels like it's dying off. It makes me sad.

I get it that some people just want to see the thing on the screen. Or your priority is to be a high status person with a loving family etc.. etc... All noble goals. I just don't feel a sense of fulfillment from a life not in pursuit of something deeper. The AI can do it better than me, but I don't really care at the end of the day. Maybe super-corp wants the AI to do it then, but it's a shame.

dionian 3 days ago | parent | next [-]

I lazily delegate things that can be automated, which frees me up to do actual feature development.

Terretta 3 days ago | parent | prev [-]

> I want to spend my life pursuing mastery of a craft, not lazily delegating.

And yet, the Renaissance "grand masters" became known as masters through systematizing delegation:

https://smarthistory.org/workshop-italian-renaissance-art/

WhyOhWhyQ 3 days ago | parent | next [-]

I have wondered about that actually. Thanks, I'll read that, looks interesting.

Surely Donald Knuth and John Carmack are genuine masters though? There's the Elon Musk theory of mastery where everyone says you're great, but you hire a guy to do it, and there's the <nobody knows this guy but he's having a blast and is really good> theory where you make average income but live a life fulfilled. On my deathbed I want to be the second. (Sorry this is getting off topic.)

fragmede 3 days ago | parent [-]

Masters of what though?

Steve Jobs wrote code early on, but he was never a great programmer. That didn’t diminish his impact at all. Same with plenty of people we label as "masters" in hindsight. The mastery isn’t always in the craft itself.

What actually seems risky is anchoring your identity to being the best at a specific thing in a specific era. If you're the town’s horse whisperer, life is great right up until cars show up. Then what? If your value is "I'm the horse guy," you're toast. If your value is taste, judgment, curiosity, or building good things with other people, you adapt.

So I’m not convinced mastery is about skill depth alone. It's about what survives the tool shift.

WhyOhWhyQ 3 days ago | parent | next [-]

I won't insult the man, but I never liked Steve Jobs. I'd rather be Wozniak in that story.

"taste, judgment, curiosity, or building good things with other people"

Taste is susceptible to turning into a vibes / popularity thing. I think success is mostly about (firstly just doing the basics like going to work on time and not being a dick), then ego, personality, presentation, etc... These things seem like unfulfilling preoccupations, not that I'm not susceptible to them like anyone else, so in my best life I wouldn't be so concerned about "success". I just want to master a craft and be satisfied in that pursuit.

I'd love to build good things with other people, but for whatever reason I've never found other people to build things with. So maybe I suck, that's a possibility. I think all I can do is settle on being the horse guy.

(I'm also not incurious about AI. I use AI to learn things. I just don't want to give everything away and become only a delegator.)

Edit: I'm genuinely terrified that AI is going to do ALL of the things, so there's not going to be a "survives the shift" except for having a likable / respectable / fearsome personality

re-thc 3 days ago | parent | prev [-]

> Steve Jobs wrote code early on, but he was never a great programmer. That didn’t diminish his impact at all.

I doubt Jobs would classify himself as a great programmer, so point being?

> So I’m not convinced mastery is about skill depth alone. It's about what survives the tool shift.

That's like saying karate masters should drop the training and just focus on the gun? It does lose meaning.

brazukadev 2 days ago | parent | prev | next [-]

It seems you are a bit obsessed with the Renaissance? Are you building a "vibeart" platform?

alehlopeh 3 days ago | parent | prev [-]

I like how you compare people to renaissance painters to inflate their egos

fragmede 3 days ago | parent | next [-]

The other surprising skill from this whole AI craze is, it turns out that being able to social engineer an LLM is a transferable skill to getting humans to do what you want.

brazukadev 2 days ago | parent [-]

One of the funniest things to see nowadays is the opposite tho, some people expecting similar responses from people but getting thrashed as we are not LLMs programmed to make them feel good

WhyOhWhyQ 3 days ago | parent | prev [-]

Inflate whose ego? Mine? It seemed more like a swipe than ego-inflation, but I was happy to see the article anyway.

re-thc 3 days ago | parent | prev | next [-]

> the "markets" are anything but rational

No, they are rational. At least those with a lot of money.

> None of the investments going on have followed any semblance of fundamentals - it's all pure instinct and chasing hype

That's not what investments are about. Their fundamentals are if they can get a good return on their money. As long as the odds of the next sucker to buy them up exists it is a good investment.

> AI is basically a toy for 99% of us.

You do pay for toys right? Toy shops aren't irrational?

fragmede 3 days ago | parent | prev [-]

> AI is basically a toy for 99% of us.

So you're at the "first they laugh at us" stage then.

AnimalMuppet 3 days ago | parent [-]

OK, but not everything that gets to that stage moves on to the next, let alone the stage after that.

But I will give you this, the "first they ignore us" stage is over, at least for many people.