Remix.run Logo
lysace 11 hours ago

I'm already burning through enough tokens and producing more code than can be maintained - with just one claude worker. Feel like I need to move into the other direction, more personal hands-on "management".

AffableSpatula 11 hours ago | parent | next [-]

I've seen more efficient use of tokens by using delegation. Unless you continually compact or summarise and clear a single main agent - you end up doing work on top of a large context; burning tokens. If the work is delegated to subagents they have a fresh context which avoids this whilst improving their reasoning, which both improve token efficiency.

storystarling 11 hours ago | parent [-]

I've found the opposite to be true when building this out with LangGraph. While the subagent contexts are cleaner, the orchestration overhead usually ends up costing more. You burn a surprising amount of tokens just summarizing state and passing it between the supervisor and workers. The coordination tax is real.

AffableSpatula 11 hours ago | parent [-]

Task sizing is important. You can address this by including guidance in the CLAUDE.md around that ie. give it heuristics to use to figure out how to size tasks. Mine includes some heuristics and T shirt sizing methodology. Works great!

xpe 9 hours ago | parent [-]

Management is dead. Long live management.

stuaxo 10 hours ago | parent | prev [-]

If there's any kind of management some of it could use small local models - e.g. to see when it looks like its stuck.