| ▲ | coffeefirst 5 days ago | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
> The cost for the same quality of output is going to drop at least 10x over the next 18-24 months. How do you know that? In 2026 the prices have been spiking. It now costs orders of magnitude more than it did in November. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| ▲ | Ukv 5 days ago | parent | next [-] | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Price of the current frontier may vary, but price for a given level of capability tends to drop pretty fast. April of last year you'd get 1431 ELO[0] from o3-2025-04-16 for $8.00 per million output tokens. April of this year you can get 1436 ELO from deepseek-v4-flash for $0.2 per million output tokens. [0]: https://huggingface.co/spaces/lmarena-ai/arena-leaderboard | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| ▲ | onlyrealcuzzo 5 days ago | parent | prev | next [-] | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
> How do you know that? Historic trends, every 18 months, performance for the same level of quality has gone down 90%. See: https://www.reddit.com/r/LocalLLaMA/comments/1gpr2p4/llms_co... And Chart 13 here: https://www.rdworldonline.com/ais-great-compression-20-chart... And here: https://epoch.ai/data-insights/llm-inference-price-trends The technology already exists now on the algorithmic front for the next 10x drop between everyone adopting DeepSeek's MLA, MoE (mostly already done), Medusa (a better version of Google's speculative decoding), Kimi's Attn Residuals, and Mimo's Sliding Window Attn, and (possibly) Microsoft's 1.58b (this may be a nothing burger). Historically, algorithmic gains are only ~30% of the pie, but there's enough out there to get to 10x, with just what's available already. The other ~70% of the pie is better training data (often synthetic) and distilling frontier knowledge. There's no sign we are tapped out on that front. > In 2026 the prices have been spiking. That's not for the SAME level of output... | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| ▲ | senordevnyc 4 days ago | parent | prev [-] | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
It now costs orders of magnitude more than it did in November. Really? Care to do the math for me? Just curious about exactly how many orders of magnitude it's gone up. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||