| ▲ | Incipient 4 hours ago | |
I have spent a SOLID 3 full days 8h/day (plus long running tasks overnight) thrashing out a random idea for a Web application using purely Opus (mostly Max, sometimes ultracode version). I'm not a project manager, but I genuinely tried a full 3-tier spec out - design->specs->build details. While it was significantly better than previous attempts, it still misses very basic things - sporadically. Eg. A clear design requirement was essentially adding clients, explained clearly and comprehensively. The ability to add clients was entirely missed in the build and iteration (there were multiple 'please check its all done' separate agent runs/checks). I can imagine in a fully autonomous deployment, in even moderate complexity, even to this day would still occasionally mess up - badly enough to cause non-trivial business issues. I haven't managed to really figure out what's the best way, but my latest thinking is really having boil down tasks to almost unit operations "add UI button, wire to Api call. End". | ||
| ▲ | A_D_E_P_T 13 minutes ago | parent | next [-] | |
> I haven't managed to really figure out what's the best way For you, the best way is to break your code down into modules insofar as possible, so that you don't overrun the context window. Opus Max starts forgetting things the minute it begins compressing your conversation -- and multiple compressions can make for gaps in memory. I find that it's also important to have another model serve as review/critique. I use Opus Max for code and 5.5 Pro for immediate code review. The latter will often pick up on things that might have been missed, and will usually provide good suggestions. | ||
| ▲ | KronisLV 3 hours ago | parent | prev [-] | |
> there were multiple 'please check its all done' separate agent runs/checks You could ask it to go through the spec point by point and then mark what is done and WHERE/WHY, then it'd point you towards exactly what might be missing. | ||