Remix.run Logo
toss1 2 hours ago

Doesn't the alignment sort of depend on who is paying for all the tokens?

If Dave the developer is paying, Dave is incentivized to optimize token use along with Anthropic (for the different reasons mentioned).

If the Dave's employer, Earl, is paying and is mostly interested in getting Dave to work more, then what incentive does Dave have to minimize tokens? He's mostly incentivized by Earl to produce more code, and now also by Anthropic's accidentally variable-reward coding system, to code more... ?