Remix clone Hacker News

new | show | ask | jobs Github

	▲	Spartan-S63 2 hours ago
		Most variants of GPT-5.5 are less chatty and token-intensive than Opus 4.8/4.7, so despite the output token price being higher, it generates fewer tokens, so the net cost is lower. Per-token pricing is totally sensible from the provider-perspective on mapping COGS to revenue, but for a consumer, different models will produce more or less tokens, meaning the cost calculation is multi-dimensional.
	▲	manquer 2 hours ago \| parent [-]
		You can configure model to be terse/concise with output style ? There are plenty of popular projects like https://github.com/JuliusBrussee/caveman which do it for you even. Input/Cache/Output ratios are use case and configuration dependent . Any benefits in one model can usually be roughly to another with configuration tuning, and discussions devolve into subjective experience. Pricing sheet is the objective way to compare cost.