Remix.run Logo
happyPersonR 5 hours ago

more thinking == more tokens === more money LOLL

overfeed 3 hours ago | parent | next [-]

Os there a cost benchmark out there? I wonder how frontier models are doing over time for cost per problem solved.

drob518 2 hours ago | parent | prev [-]

I think they are optimizing for one-shot performance because that will drive usage. They can’t afford to look bad in the benchmarks. And if that means consuming an order of magnitude more tokens, well, that’s good for business, too.