Remix.run Logo
noelwelsh 11 hours ago

I wish people would describe in more detail the tasks they use LLMs to code. My experience is that simple components in an existing architecture are fine, but anything requiring architectural considerations quickly becomes a mess. On my projects (e.g. a ui framework), running multiple agents in parallel would just increase the speed at which it can stuff up the project.

germanptr 9 hours ago | parent | next [-]

I get this question a lot, and I found it hard to answer briefly, so I ended up writing a longer post about how I work:

https://www.trigosec.com/insights/mob-programming-for-one/

The short version is that I don’t let AI agents work unsupervised on my code. I treat them like participants in a mob programming session instead of autonomous developers. Different agents get different roles (implementer, reviewer, architect, security reviewer, etc.), and I stay involved throughout the process.

I also agree with your point about architecture. Generating isolated components is relatively easy; preserving and evolving the architectural boundaries across a larger codebase is much harder.

We’re still missing a good way to express and measure architectural quality. Until then, architecture heavy work requires much closer supervision than implementation heavy work

Swizec 2 hours ago | parent | next [-]

> We’re still missing a good way to express and measure architectural quality

Architectural complexity[1]! There’s several really good papers on this.

Unfortunately it never caught on and we don’t have great automated tools to spit out a number. Also the majority of people just don’t care enough. Research in this field kinda died out when we invented microservices and started treating those as a silver bullet to The Architecture Problem (it’s not [2])

[1] https://swizec.com/blog/why-taming-architectural-complexity-...

[2] https://youtu.be/y8OnoxKotPQ

fbrchps 25 minutes ago | parent [-]

Didn't even need to click the YouTube link, I knew it would be Krazaam.

vslira 2 hours ago | parent | prev [-]

> The short version is that I don’t let AI agents work unsupervised on my code. I treat them like participants in a mob programming session instead of autonomous developers.

I wonder if OS maintainers would have a leg up in defining workflows to better leverage this. Of course, OS contributors are autonomous developers, but maybe a trick or two might transfer across

davidcann 9 hours ago | parent | prev | next [-]

I built this with 94% written by coding agents: https://buildermark.dev/

The complete log of all prompts and commits is here: https://demo.buildermark.dev/projects/u020uhEFtuWwPei6z6nbN

MonstraG 8 hours ago | parent [-]

It seems that pages 2-5 on

https://demo.buildermark.dev/projects/u020uhEFtuWwPei6z6nbN/...

still show content of page 1

davidcann 8 hours ago | parent [-]

Thanks for the report. I messed up the CDN settings. It looks fixed now.

properbrew 10 hours ago | parent | prev | next [-]

I used LLMs to develop Whistle Enterprise (https://whistle-enterprise.com) from the ground up, from scratch.

It's taken _a lot_ of time and effort, but this is an example of what can be developed using LLMs alone.

You have to have dedication and a goal to reach, but you can absolutely build anything if you're building with the right foundations in mind.

ryanackley 8 hours ago | parent | next [-]

I think the relevant question isn’t what can be built but the amount of effort in comparison to doing this the old fashioned way.

What do you think the productivity gain was from using an LLM? This question assumes you’re already an experienced developer.

motoroco 4 hours ago | parent | next [-]

There’s no free lunch, it takes time and effort still. And expertise if you need it to be robust.

In terms of velocity, let me offer some numbers. In 6 months I generated >150k lines of code and merged 10k PRs to ship and iterate on https://plotalong.app

I follow best practices and isolate agents to continuously deployed dev environments, semi-manually review PRs and gate the release process between multiple protected envs. The project is getting close to 500 end-to-end tests in Playwright.

That’s just working nights and weekends. Before AI, it took my team at the office 4 years to produce this much work. There are some qualitative differences but the speed and results are real

andai 6 hours ago | parent | prev [-]

n=1 but, a friend of mine spent the last few months working on an experimental music software with Claude. What he built is amazing and far beyond my abilities (I have been programming for 20 years). He doesn't know any programming.

In fact, it's far beyond what I would even attempt, because I've just spent two decades building up a data bank of how hard things are supposed to be.

He doesn't know it's supposed to be hard, so he just does it.

leguy 2 hours ago | parent | prev [-]

neat. I saw the "no bot joins the call". Is it obvious to others in the virtual meeting that you are using this tool?

amelius 2 hours ago | parent | prev | next [-]

I personally limit LLMs to single files only at the moment. Self-contained components.

Using LLMs in a larger scope can sometimes work, but it has the real risk of turning a project into a mess after which you will have to undo the work and lose a lot of time.

Also, using LLMs this way with less clear boundaries will make reading and maintaining the code more cumbersome.

pjmlp 3 hours ago | parent | prev | next [-]

Me when not trying to meet management expectations, only as smarter code completion, formatting code, basic code analysis, and helping copy pasting code examples between languages.

Me when meeting management expectations, agent orchestration tools like Boomi and Workato calling into tools, doing with AI what a few years ago would be done with BPEL.

TheBigSalad 4 hours ago | parent | prev | next [-]

You have to make those architectural decisions and feed them to the agents. Be very specific. That's been my experience.

pipes 4 hours ago | parent | prev | next [-]

I found that this guys stuff has really helped me:

https://youtu.be/-QFHIoCo-Ko?is=FYYdukWluYX3vdQL

Worth a watch.

nullbio 10 hours ago | parent | prev | next [-]

It's great for people who are just maintaining something. Less so for someone building something from scratch, in the earlier phases.

Npovview 10 hours ago | parent | prev [-]

There are hour long youtube videos where people explain the process by using a complex toy project. Search for them.