The abrupt swing in many non-technology company IT departments from "hey developer, you aren't using enough tokens" to this is just too funny.

And I'm seeing almost no self-awareness from leaders. They are making decisions about things that they just don't understand. And are completely unworried about it. Just blindly following whatever the news cycle is about AI.

▲

datakan 34 minutes ago | parent | next [-]

The closer people live to the consequences of their decisions the more rational they become. Until leaders(and I use that term loosely) are held accountable, the insanity will continue.

	▲	greesil 24 minutes ago \| parent \| next [-]
		Their only accountability is to the stock price. The insanity will continue.
	▲	20 minutes ago \| parent \| prev [-]
		[deleted]

▲

qoez 32 minutes ago | parent | prev | next [-]

I feel like most successful businesses have such a moat of required capital to compete with them that even tho in theory poor decisions like this is supposed to give opportunities for entreprenuers to hit when the big dogs make a wrong move, it doesn't end up happening.

▲

sdeframond 19 minutes ago | parent | prev | next [-]

Groups resist to change - the bigger the group, the most resistance there is.

As a leader, pushing for rapid change cannot really be nuanced lest the push dissipates into the organization's entropy.

▲

onlyrealcuzzo 35 minutes ago | parent | prev [-]

The actual cost is going to drop 99% in ~4 years.

How much that makes it into enterprise pricing is TBD, since none of the hyper scalers are making money yet of selling AI inference.

Almost all businesses are ahead of the gun. For most of their use cases, AI is either not yet good enough on its own, or good enough but too expensive.

No one wants to get left behind, so everyone's trying to get onto it now, even though it's not ready for what most enterprises want to do with it.

It's easy for them to look at a small startup without billions of lines of legacy business logic debt and see them having success and wonder why they can't have just as much - or more - why they're bigger so they should have better and more success, right???

Wrong...

But when it gets ~99% cheaper for local inference over the next 4 years, at the same time the price per watt improve 4x -> a lot of those cases will start to pencil out.

▲

krona 30 minutes ago | parent | next [-]

> The actual cost is going to drop 99%

Do you mean the marginal cost by the producer, or the cost on the consumer? I can't see the price of electricity falling much, and the demand curve is apparently exponential if the hype is to be believed.

▲

packetlost 30 minutes ago | parent | prev | next [-]

I don't see how this is even remotely true. Unless there's some super breakthrough into a fundamentally different architecture, there's not really a path to a 50% reduction in price, much less a 99% reduction.

▲

datakan 33 minutes ago | parent | prev | next [-]

What makes you think prices will drop? Everyone I’ve spoken to believes they will only skyrocket. Genuinely curious

▲

onlyrealcuzzo 28 minutes ago | parent [-]

The technology already exists now for the next 10x drop between everyone adopting DeepSeek's MLA, MoE (mostly already done), Medusa (a better version of Google's speculative decoding), Kimi's Attn Residuals, and Mimo's Sliding Window Attn, and (possibly) Microsoft's 1.58b (this may be a nothing burger).

Historic trends, every 18 months, performance for the same level of quality has gone down 90%.

See: https://www.reddit.com/r/LocalLLaMA/comments/1gpr2p4/llms_co...

And Chart 13 here: https://www.rdworldonline.com/ais-great-compression-20-chart...

And here: https://epoch.ai/data-insights/llm-inference-price-trends

Historically, algorithmic gains are only ~30% of the pie, but there's enough out there to get to 10x, with just what's available already. The other ~70% of the pie is better training data (often synthetic) and distilling frontier knowledge. There's no sign we are tapped out on that front.

Additionally, GRAM (from ~10 days ago) is likely to be a 5-10x on its own (if not substantially more for smaller models).

Further, that's not even counting that cost per watt is still dropping ~2x every 2 years on its own.

The human brain is still 8-10 orders of magnitude more efficient than the best LLMs of today. With ~1/10th of global capex riding on AI, if you don't think they're going to knock of 2 orders of magnitude more, when it's this obvious and easy... I don't know what to tell you...

	▲	datakan a minute ago \| parent [-]
		This is great food for thought, thank you

▲

bakugo 30 minutes ago | parent | prev [-]

Prices have been very obviously trending up, not down. Even open weights models are becoming more expensive with every release. Computer hardware is ballooning in price.

	▲	abalashov 2 minutes ago \| parent [-]
		Just wait for the next model and the next model architecture. Just wait for it, bro.