| ▲ | csomar 5 hours ago | |
Holy AI, this shit is expensive. I was a bit suspect (no experience too) so I run some Claude calculations and it's also giving me a $350-450k to run GLM-5.2 at full precision (un-quantized). For rental on Azure, it's giving me $96–$144/hr. That translates to $22/M tokens which way more expensive than API pricing at z.ai. To get close to API pricing, you have to seek cheaper providers but that only gets you close to z.ai pricing not lower. Caveat here is that all of this is Claude math, but would be interested in someone more knowledgeable of the math chiming in. I was thinking that API pricing was highly inflated in order to cover subscription costs but with these calculations it might be not? | ||