Remix.run Logo
Spartan-S63 2 hours ago

Most variants of GPT-5.5 are less chatty and token-intensive than Opus 4.8/4.7, so despite the output token price being higher, it generates fewer tokens, so the net cost is lower.

Per-token pricing is totally sensible from the provider-perspective on mapping COGS to revenue, but for a consumer, different models will produce more or less tokens, meaning the cost calculation is multi-dimensional.

manquer 2 hours ago | parent [-]

You can configure model to be terse/concise with output style ? There are plenty of popular projects like https://github.com/JuliusBrussee/caveman which do it for you even.

Input/Cache/Output ratios are use case and configuration dependent . Any benefits in one model can usually be roughly to another with configuration tuning, and discussions devolve into subjective experience.

Pricing sheet is the objective way to compare cost.