| ▲ | Spartan-S63 2 hours ago | |
Most variants of GPT-5.5 are less chatty and token-intensive than Opus 4.8/4.7, so despite the output token price being higher, it generates fewer tokens, so the net cost is lower. Per-token pricing is totally sensible from the provider-perspective on mapping COGS to revenue, but for a consumer, different models will produce more or less tokens, meaning the cost calculation is multi-dimensional. | ||
| ▲ | manquer 2 hours ago | parent [-] | |
You can configure model to be terse/concise with output style ? There are plenty of popular projects like https://github.com/JuliusBrussee/caveman which do it for you even. Input/Cache/Output ratios are use case and configuration dependent . Any benefits in one model can usually be roughly to another with configuration tuning, and discussions devolve into subjective experience. Pricing sheet is the objective way to compare cost. | ||