| ▲ | martinald 4 hours ago | |
Very interesting. As I wrote in this article https://martinalderson.com/posts/is-the-ai-compute-crunch-he... a couple of weeks ago: "One thing I really suspect we'll see a lot more of is much more generous rate limits at 'off peak' times - likely to be early morning UTC - as there is no doubt a lot of "idle" compute sitting there" I strongly suspect this will end up in the opposite happening - where peak tokens are far more "expensive" (whether that be thru usage limits of API costs) than off-peak. PS: Anthropic have managed to improve reliability but are absolutely shredding opus tok/s at peak times. It absolutely crawls on the web (maybe 2-3 tok/s?) and I believe that on non-max plans it's also incredibly slow on claude code. | ||
| ▲ | Aboutplants 4 hours ago | parent [-] | |
“I strongly suspect this will end up in the opposite happening - where peak tokens are far more "expensive" (whether that be thru usage limits of API costs) than off-peak.” This only happens once/if competition eases up. Until then, it’s a race to the bottom | ||