I get about 35-40tok/sec on a 3090.
It's actually about the same speed when accounting for how much more responsive my system is to Anthropic's saas infrastructure