At the enterprise level though, its going to be hard to want to use a service in which costs are not predictable, and keeping those costs under control requires employee training.

▲

jochem9 7 hours ago | parent | next [-]

You can put a limit on token spend and provide training (and even pre-configured workflows) on how to limit token spend.

Like the other commenter said: cloud spend can also spin out of control if you don't pay attention, yet we've found ways to keep it under control (training, guardrails, limits, transparancy).

	▲	harimau777 42 minutes ago \| parent \| next [-]
		The problem that I see is what you do if someone runs out of tokens. It doesn't very well work to say "well I guess you just get fired because you can't work at full speed for the rest of the month". Personally, this feels like its just trying to push the work of managers in allocating resources onto developers so that they have more work to do and can be blamed if anything goes wrong.
	▲	darig 3 hours ago \| parent \| prev [-]
		[dead]

▲

mrgoldenbrown 11 hours ago | parent | prev | next [-]

>...use a service in which costs are not predictable, and keeping those costs under control requires employee training.

Isn't this a (mildly exaggerated) description of AWS, which is a very successful service?

▲

noodletheworld 5 hours ago | parent [-]

Mmm… but for AWS its pay for external use right?

So your costs scale with the number of users you have.

Thats an op ex that you can explain.

For tokens for developers its maybe closer, cost/outcome wise, to hiring an external consulting company to write your code; money paid scales with work done, no promise of delivery, arbitrary unpredictable external price changes.

Its not quite the same; though, similarly lucrative for consultants.

	▲	logicchains 3 hours ago \| parent [-]
		>Mmm… but for AWS its pay for external use right? Not if you're using it for running builds, running research jobs, model training, etc.

▲

sidewndr46 16 hours ago | parent | prev | next [-]

Am I losing my mind, aren't there multiple headlines each day about companies penalizing employees for not using AI enough?

▲

iSnow 16 hours ago | parent | next [-]

That was roughly 3 weeks ago, with the reprising of Claude 4.7 and GPT 5.5, things have become more spicy.

▲

foolserrandboy 3 hours ago | parent | next [-]

2 months ago: no limits. 1 month ago we had a leaderboard for whoever had the highest token spend not taking into account what was actually produced. This week: “everyone is using opus too much, just use it for planning.”

▲

sidewndr46 15 hours ago | parent | prev [-]

use AI, don't use AI, this whole thing is getting really hard to follow

	▲	andrekandre 12 hours ago \| parent \| next [-]
		i've worked at so many places where the propaganda/marketing and reality on the ground is so disorienting/shocking i don't really expect this to be any different...
	▲	lukan 3 hours ago \| parent \| prev [-]
		It is allmost as if humans ain't of a single mind.

▲

basch 15 hours ago | parent | prev [-]

since those headlines started ive felt it just encouraged inefficiency. "say as much as you can without saying anything." if you were accomplishing your task the need for more would end, thus there is incentive to never succeed.

▲

layer8 17 hours ago | parent | prev | next [-]

To be fair, the cost of software development has always been fairly unpredictable. What may be different is that the cost used to be roughly proportional to man-hours spent, while now the number of agents running in parallel may be less predictable.

▲

ilovecake1984 16 hours ago | parent | next [-]

The cost per month is 100% known and always has been. What has been variable is the rate of delivery. AI is different and can be substantial in countries with lower wages.

▲

xienze 16 hours ago | parent | prev [-]

> To be fair, the cost of software development has always been fairly unpredictable.

Yes, but in a "oops this is gonna take another two months to finish" kind of way, not the "oops this is the 12th time this month 8 developers have burned $2K in tokens in a single day and no one really knows how it happened" kind of way.

▲

kridsdale1 16 hours ago | parent [-]

We’re all being given belt-loaded machine guns and tossed on to Planet K. We used to pay for the salaries of soldiers, now we have an Ammo Budget.

▲

dgellow 6 hours ago | parent [-]

A belt loaded spinwheel machine gun, where there are some chances the next bullet is a dummy round, or goes in the wrong direction. And everytime you reload a new soldier is in charge of the gun

	▲	bluGill 2 hours ago \| parent [-]
		You don't need that analogy as the normal use of a automatic gun in war is not to kill, it is to suppress - stop the enemy from moving. If you are hit by a gun in automatic mode it is your own stupid fault. When you want to kill someone you switch to one shot or maybe 3 round bursts.

▲

salawat 18 hours ago | parent | prev [-]

There's no fucking training to mitigate a slot machine.

▲

subscribed an hour ago | parent | next [-]

LOL, that's a sophisticated and sometimes slightly unpredictable multitool.

If this is the "analogy" you go for, you don't seem to be suited to make that comparison.

▲

LPisGood 4 hours ago | parent | prev | next [-]

There’s actually been a ton of research on how to optimize “slot machines,” at least in a generalized sense. For more reading, check out the literature on multi armed bandits.

▲

serf 4 hours ago | parent | prev | next [-]

that analogy is so boring now with so many real world examples of actual LLM work.

people still can't get over the unreasonable effectiveness of algorithms.

	▲	arkadiytehgraet 32 minutes ago \| parent \| next [-]
		There have also been winners of a slot machine gamba, so the analogy quite holds. I would even argue that there are considerably more slot machine gamba winners than the real world examples of actual LLM work.
	▲	greenchair 2 hours ago \| parent \| prev [-]
		nondeterminism will always be anathema to the engineering mind

▲

dgellow 6 hours ago | parent | prev [-]

Games like Diablo are basically a whole bunch of slot machines, and there are strategies you can follow to optimize your run.

▲

gambiting 5 hours ago | parent [-]

Yes, because in video games there is always a chance to win so you can optimize your strategy around that chance. If you have a 1% chance to drop a legendary weapon, the question becomes how do I manufacture 100 chances for a weapon drop in the shortest possible time. With agentic coding there is no such guaranteed chance - in a way it's worse than a slot machine that is guaranteed to pay out eventually. You could spend hundreds of millions of tokens and still not get what you asked for.

▲

dgellow 32 minutes ago | parent | next [-]

You’re right, the arpg analogy isnt great, it’s too simplistic. I was trying to come up with something heavily stochastic where people are coming up with strategies to get the odds in their favor. Maybe closer to speculating on the real estate market? But even that feels too simplistic compared to LLMs. Even the definition of a win isn’t well defined.

Actually it’s really its own thing, I don’t think the slot machine analogy works too well, you also have fixed odds (and you know they aren’t in your favor), and a binary output

▲

echoangle 4 hours ago | parent | prev [-]

> If you have a 1% chance to drop a legendary weapon, the question becomes how do I manufacture 100 chances for a weapon drop in the shortest possible time.

Sidenote but I hope everyone realizes that 100 is kind of arbitrary here and does not mean the total chance to to get something is 100%.

	▲	avadodin 2 hours ago \| parent [-]
		you don't have to do the math unless it's on the exam, lol.