▲ | strictnein 5 days ago | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Confused on the Max 5x vs Max 20x. I'm on the latter, and in my email it says: > "Most Max 20x users can expect 240-480 hours of Sonnet 4 and 24-40 hours of Opus 4 within their weekly rate limits." In this post it says: > "Most Max 5x users can expect 140-280 hours of Sonnet 4 and 15-35 hours of Opus 4 within their weekly rate limits." How is the "Max 20x" only an additional 5-9 hours of Opus 4, and not 4x that of "Max 5x"? At least I'd expect a doubling, since I'm paying twice as much. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
▲ | deviation 4 days ago | parent | next [-] | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
This makes sense if we compare compute cost instead of hours. Transformer self-attention costs scale roughly quadratically with context window size. Servicing prompts in a 32k-token window uses much more compute per request than in an 8k-token window. A Max 5× user on an 8k-token window might exhaust their cap in around 30 hours, while a Max 20× user on a 32k-token window will exhaust theirs in about 35 to 39 hours instead of four times as long. If you compact often, keep context windows small etc, I'd wager that your Opus 4 consumption would approach the expected 4× multiplier... In reality, I assume the majority of users aren't clearing their context windows and just letting the auto-compact do it's thing. Visualization: https://codepen.io/Sunsvea/pen/vENyeZe | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
▲ | thomasfromcdnjs 5 days ago | parent | prev | next [-] | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Would love more feedback on this, I will definitely downgrade from Max 20x if it is the case. Cost me $350 a month in Australia... | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
▲ | akmarinov 5 days ago | parent | prev | next [-] | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
I upgraded to 20x because i was constantly running against Opus limits and now it seems the 20x is almost equal to the 5x in that regard | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
▲ | lvl155 5 days ago | parent | prev | next [-] | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
This is why I stopped using the MAX. Downgraded to Pro and started using o3 and others via API. I really don’t need that many hours to game plan in the beginning. At most it will cost me $10 between o3, Gemini, and Opus per project. There are new model releases every couple of weeks and I’d hate to get stuck with just one provider. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
▲ | ImaCake 4 days ago | parent | prev | next [-] | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
The ambiguity here is awful marketing practice. This bitter pill would be much easier to swallow if it was a hard number instead of these vague ranges. It would serve Anthropic better too - telling people they only get 300hrs vs between 240-480 (which they will naturally evaluate as 240hrs) will mean less users leaving the platform. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
▲ | dawnerd 4 days ago | parent | prev | next [-] | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
They really need to to just a limit so you can see how much you've used, not some vague hours per week or whatever. Github copilot will tell you, you have 300 requests with sonnet a month, makes it really easy to know when you're blowing past without having to worry about how long something has run. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
▲ | yobid20 4 days ago | parent | prev | next [-] | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Someone should do a study then file a class action if their marketing material is false. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
▲ | foota 5 days ago | parent | prev [-] | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
You're paying for prioritization during high traffic periods, not for 2x usage. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|