Remix.run Logo
fragmede 12 hours ago

I mean, what is the validation system? Either it exists in code, and thus can be discovered if you point the AI at repo, or... what, it doesn't exist?

For the UX, have it explore your existing repos and copy prior art from there and industry standards to come up with something workable.

Web scale issues can be inferred by the rest of the codebase. If your terraform repo has one RDS server, vs a fleet of them, multi-region, then the AI, just as well as a human, can figure out if it needs Google Spanner level engineering or not. (probably not)

Bigger picture though, what's the process of a human logs an under specified ticket and someone else picks it up and has no clue what to do with it? They're gonna go ask the person who logged the bug for their thoughts and some details beyond "hurr Durr something something validation". If we're at the point where AI is able to make a public blog post shaming the open source developer for not accepting a patch, throwing questions back to you in JIRA about the details of the streaming validation system is well within its capabilities, given the right set of tools.

gordonhart 12 hours ago | parent [-]

Honestly curious, have you seen agents succeed at this sort of long-trajectory wide breadth task, or is it theoretical? Because I haven't seen them come close (and not for lack of trying)

codegangsta 12 hours ago | parent | next [-]

Yeah I absolutely see it every day. I think it’s useful to separate the research/planning phase from the building/validadation/review phase.

Ticket trackers are perfect for this. Just start with asking AI to take this unclear, ambiguous ticket and come up with a real plan for how to accomplish it. Review the plan, update your ticket system with the plan, have coworkers review it if you want.

Then when ready, kick off a session for that first phase, first PR, or the whole thing if you want.

kolinko 10 hours ago | parent | prev | next [-]

In my expedience, Claude Code with opus 4.5 is the first one to tackle such issues well.

fragmede 10 hours ago | parent | prev [-]

Opus 4.6, with all of the random tweaks I've picked up off of here, and twitter, is in the middle of rewriting my golang cli program for programmers into a swiftui Mac app that people can use, and it's totally managing to do it. Claude swarm mode with beads is OP.