| ▲ | cbg0 7 hours ago | |||||||||||||
If it uses half the tokens to complete a task, then doubling the cost is perfectly fine. But is that actually true? | ||||||||||||||
| ▲ | 2001zhaozhao 7 hours ago | parent | next [-] | |||||||||||||
This happens with every new model release though. The model makes less mistakes and spends less time fixing them, resulting in a token usage reduction for the same difficulty of task. Almost any task other than straight boilerplate will benefit from this. In the same vein, I would guess that Opus 4.7 is probably cheaper for most tasks than 4.6, even though the tokenizer uses more tokens for the same length of string. | ||||||||||||||
| ||||||||||||||
| ▲ | jstummbillig 7 hours ago | parent | prev [-] | |||||||||||||
We'll find out! | ||||||||||||||