| ▲ | coffeefirst an hour ago | |
Yes, that’s basically what I’m talking about. Without subsidies, that’s a lot of incentive to use more efficient/open weight models, but also to use them in ways that are less compute intensive—fewer tokens, shorter reasoning chains, less nonstop, and generally tie up less hardware for a lot less time. | ||