That's because you're treating the problem as an engineer instead of an "influencer" or "10xer" or whatever. You're treating it as a problem to be solved with engineering and AI is merely a tool to do so. It is, in my experience, vanishingly rare for an engineer to have a problem that needs to be solved with multiple hours of unattended AI code generation.

I've only found one single application where it makes even the slightest amount of sense to have an AI grind away for hours on end. I'm reverse engineering a widget which contains five separate firmware images. I've dumped the binary from the widget and I set the AI to decompile and reverse engineer these interrelated firmware projects. It's a compelx task, but very well bounded. It's not complicated work, but it's a lot of work, and the end result is a C-shaped pile of text that is only informative, it never would be compilable on its own even if I did it by hand. The quality of the output is tightly bounded by the input assembly and the overall output artifact is documentation in the shape of code.

I don't have any qualms about letting an AI go ham on it unattended because the stakes are zero. But if the AI can beat the assembly into a recognizable C project, it's much easier for me to read and reason about. Easy win, I think.

▲

rbalicki 3 hours ago | parent [-]

I'll add another use case for letting an AI go ham: many small, atomic refactors where the name of the game is never breaking anything.

My personal OSS projects don't have the scale to necessarily make this worth it, but at work I run three pipelines using Barnum (https://barnum-circus.github.io/). First, one that ingests files, identifies refactors (from a pre-approved list), and places a precise description of the refactor to be done in a queue; second, one that reads from said queue, implements and creates PRs (there is a lot of "check that the PR is correct" here as well); and a third that babysits PRs until they land. I've landed hundreds of PRs in this way, with very little effort on my part.

▲

frizlab 3 hours ago | parent [-]

I recently in $COMPANY had a coworker try fable to do a refactor where not breaking anything was the game.

It broke something at the first PR.

I think we’re not there yet.

▲

sunrunner 2 hours ago | parent [-]

I've found that adding "Make no mistakes." to my prompt usually helps with this kind of problem...

▲

cubano an hour ago | parent | next [-]

perhaps simply threatening to fire it would also do the trick...it sure has worked well on us for a long time now.

▲

lemming an hour ago | parent | prev | next [-]

Or if the code is really important, sometimes even “please make no mistakes” is necessary.

▲

dozerly 2 hours ago | parent | prev [-]

We are so many layers deep in AI hype that I honestly can’t tell if this is /s or not

	▲	cadamsdotcom a minute ago \| parent [-]
		Plug it in to ChatGPT and ask for a summary and an opinion on whether it’s sarcasm, you’ll get something 10 times as long containing no information at all. Truly the future!