| ▲ | sathackr 4 hours ago | ||||||||||||||||||||||
The opposite of that has been happening for 20 years now with cloud compute. It won't happen with AI models either. It's almost ingrained in the American business model now. Outsource everything. Nobody wants to manage a room full of servers when they can spend 2-3x as much and outsource that headache along with the responsibility for it. Same will happen with AI. Whether that means paying Anthropic that premium or paying AWS. I'm in a relatively small business, we recently had an outage related to our local infrastructure. I got pressure from the CEO saying it wasn't reliable to host our own infrastructure anymore even though our total internal down time over the last 5 years is significantly less than even a single of the larger recent AWS outages. Everyone wants to shuck the chore and the responsibility. | |||||||||||||||||||||||
| ▲ | preommr 3 hours ago | parent | next [-] | ||||||||||||||||||||||
> The opposite of that has been happening for 20 years now with cloud compute. It won't happen with AI models either. AI is different. Cloud computing genuinely is cheaper on average. It's better than paying for cisco servers, and at scale, it's cheaper than managed platforms (ala Heroku), and it's a coin toss for when you're in the middle ground and constantly approaching the point of rebuilding poor-man versions of existing products but with very very expensive engineering salaries. In contrast, local models offer dramatic savings, and are magnitude of orders better in certain aspects: like stability - the performance is all over the place with traditional AI companies as they divert compute to their next big thing. The benefits to maintaining your own infrastructure are pretty moderate to low, with very high risk. And also, alternate models are pretty easy to use and easy to swap out unlike the vendor lock-in that exists with cloud services. | |||||||||||||||||||||||
| |||||||||||||||||||||||
| ▲ | TkTech 3 hours ago | parent | prev | next [-] | ||||||||||||||||||||||
For many companies (country-dependent) that's not really why they use cloud services vs purchasing. It's tax shenanigans and business process overhead. OpEx vs CapEx, and a small (%) bump in the huge AWS bill no one will even notice or a $30k+ invoice for hardware that has to go through rigorous review and 3 departments. Same reason people pay for things through the AWS marketplace (like Vanta) instead of having to go through their invoicing process. | |||||||||||||||||||||||
| |||||||||||||||||||||||
| ▲ | dreambuffer 3 hours ago | parent | prev | next [-] | ||||||||||||||||||||||
It's just not comparable though is it? You need cloud services because it's physically impossible to use your single home computer as a server, CDN, load balancer, mass storage, security service, and distributed system. But AI is just weights, you can run a reasonably intelligent model at home, or on a few GPUs if you're a small-medium sized company, and it doesn't require dedicated maintenance. | |||||||||||||||||||||||
| |||||||||||||||||||||||
| ▲ | cheema33 3 hours ago | parent | prev | next [-] | ||||||||||||||||||||||
> I got pressure from the CEO saying it wasn't reliable to host our own infrastructure anymore even though our total internal down time over the last 5 years is significantly less than even a single of the larger recent AWS outages. Same here. My job as a software dev does not require me to self-host services we need and use. Quite the opposite. But, I am reluctant to hand over all control to AWS or equivalent for several reasons that I will get into here. I have found that Infrastructure as Code (IaC) and modern tools like opentofu, ansible, combined with frontier AI models and harnesses gives you superpowers in this space. Almost all of our self-hosted services are fully managed by these tools. e.g. We perform backups and test them more often now than we ever did before. Entirely because it is so much easier to do all of that now. | |||||||||||||||||||||||
| ▲ | Terr_ an hour ago | parent | prev | next [-] | ||||||||||||||||||||||
IMO local-vs-cloud may be a misleading dichotomy, versus:
There are some important differences between 3 and 4 in terms of data privacy and security. | |||||||||||||||||||||||
| ▲ | derfurth 3 hours ago | parent | prev | next [-] | ||||||||||||||||||||||
That's an interesting take, however there is no ongoing maintenance related to local models, maybe the only effort is giving more capable machines to the workforce; but yeah I can see how it might feel like a barrier. | |||||||||||||||||||||||
| |||||||||||||||||||||||
| ▲ | davidw 3 hours ago | parent | prev | next [-] | ||||||||||||||||||||||
Still though, perhaps the existence of low-margin, generic, cloud LLM's puts some downward pressure on the 'brand name' companies? | |||||||||||||||||||||||
| ▲ | CamperBob2 2 hours ago | parent | prev [-] | ||||||||||||||||||||||
outsource that headache along with the responsibility for it You know what gives me headaches? When I'm in the middle of a session and the model gets rug-pulled out from under me because somebody at the model provider didn't pay the Trump bill that month. Or when someone at the model provider decides that the curve-fitting algorithm in my graphics package looks a little too much like Skynet for comfort. Or when they do any number of other things to undermine my work for the sake of their business model, some of which I won't even notice until the damage is done. The sad thing is, if you know how inference works, you know that it really is insanely wasteful for everybody to run it locally. If anything naturally belongs in the cloud, it's inference. But at the same time, what choice are we being given? | |||||||||||||||||||||||