▲ | tempoponet 2 days ago | |
Once local models are good enough there will be a $20 cloud provider that can give you more context, parameters, and t/s than you could dream of at home. This is true today with services like groq. | ||
▲ | theshrike79 a day ago | parent | next [-] | |
Anthropic used to have unlimited subscriptions, then people started running angents 24/7. Now they have 5 hour buckets of limited use. Groq most likely stays afloat because they're a bit player - and propped by VC money. With a local system I can run it at full blast all the time, nobody can suddenly make it stupid by reallocating resources to training their new model, nobody can censor it or do stealth updates that make it perform worse. | ||
▲ | sunir 2 days ago | parent | prev | next [-] | |
Not exactly. Those models are based on intermittent usage. If you're using an AI engineer using a sophisticated agent flow, the usage is constant and continuous. That can price to an equivalent of a dedicated cube at home over 2 years. I had 3 projects running today. I hit my Claude Max Pro session limits twice today in about 90 minutes. I'm now keeping it down to 1 project, and I may interrupt it until the evening when I don't need Claude Web. If I could run it passively on my laptop, I would. | ||
▲ | hatefulmoron 2 days ago | parent | prev [-] | |
Groq and Cerebras definitely have the t/s, but their hardware is tremendously expensive, even compared to the standard data center GPUs. Worth keeping in mind if we're talking about a $20 subscription. |