| ▲ | eggbrain 3 hours ago |
| For those of us on subscription plans: * From today through June 22, Fable 5 is included on Pro, Max, Team, and seat-based Enterprise plans at no extra cost. * On June 23, we’ll remove Fable 5 from those plans. Using it after that will require usage credits. If capacity allows, we’ll extend the included window. * After this point—when sufficient capacity allows us to do so—we aim to restore Fable 5 as a standard part of subscription plans. We intend to do this as quickly as we can. The "offer, then remove" aspect is a bit eyebrow-raising -- it feels like they are trying to get subscribers to switch to usage-based billing, which makes me wonder if we'll ever get it after that June 22nd window. |
|
| ▲ | jrflo 2 hours ago | parent | next [-] |
| Still satisfied with my switch to codex/chatgpt. I couldn't imagine switching away from claude code when it first launch but with the drastically more generous usage on codex for the same subscription tier I just can't justify it. |
| |
| ▲ | goranmoomin an hour ago | parent | next [-] | | My experience is that the GPT-family of models are very smart and figure out bugs, edge cases a bit better, but it produces code that is much less mergable – if you review the code, it introduces a lot more useless/inappropriate heavy abstractions and wrapper functions, compared to the Claude-family models which introduces the right amount of straightforward human-style code. I can recognize so much of the GPT/Codex generated code long after it gets merged (not by me). Additionally, the time spent on every agent turn on GPT 5.5 is much longer compared to Claude Opus 4.8, which means iterating on the code takes a lot more patience, and there's a lot more nitpicks to pick when actually using GPT 5.5 to do software engineering. Feels like GPT-style models are more geared on doing one-shot software vibing (and handling the vibe coded mixture) compared to Claude's focus on actual software maintenance. I got a GPT Pro sub for free and wanted to cancel my Claude subscription so much, but I still keep reaching Claude models a lot more. Frustrating. | | |
| ▲ | PhilipDaineko 7 minutes ago | parent | next [-] | | "5. DON'T FUCKING OVERENGINEER! WRITE THE SIMPLEST CODE THAT CAN POSSIBLY WORK! NO NESTED LAYERS OF ABSTRACTION! NO UNNECESSARY CLASSES OR METHODS! NO DESIGN PATTERNS UNLESS THEY ARE ABSOLUTELY NECESSARY! NO MAGIC! NO SHENANIGANS! JUST THE DAMN CODE THAT GETS THE JOB DONE IN THE MOST STRAIGHTFORWARD WAY POSSIBLE! THE FIRST PRIORITY IS TO WRITE CODE THAT IS EASY TO READ AND UNDERSTAND AND READ!!!" this is the line I keep in Agents.md that helps me prevent Codex from playing smart | | |
| ▲ | bertil a minute ago | parent [-] | | The urge to put capitalized, repetitive, borderline abusive instructions should be studied. I haven't read many academic papers looking at the frustrations around repetitive patterns. |
| |
| ▲ | superkickstart 32 minutes ago | parent | prev | next [-] | | I'm not sure if i do something differently but i have the exact opposite experience with these models. Claude always feels like it's generating way too overdesigned and hard to understand code with the vibe oriented feel while codex is cleaner and more "task at hand" and easier to work with. | |
| ▲ | dilap 27 minutes ago | parent | prev | next [-] | | Have you tried iterating on style feedback in AGENTS.md? I've been reasonably successful using this to get it to output code in a terse, non-defensive style that matches my hand-written code. | |
| ▲ | vruiz 28 minutes ago | parent | prev [-] | | This is my experience as well. I have defined a CLAUDE.md rule to ask codex to automatically code review, and I tell it that the reviewer is very picky and to only implement what it considers valuable feedback. I hope they don't converge over time, currently, in combination they works really well. |
| |
| ▲ | sigbottle 2 hours ago | parent | prev | next [-] | | Codex IME is just smarter, I think it shows given both anecdotes but also how OpenAI has always been at the front of programming competitions and math problems. But Claude models seem to be better at long term problems or more ambiguous problems. I'm curious as to what the primary benefit here. Are there secret improvements in training? There hasn't been much in fundamental model architecture, I don't think. What about harnesses? I wonder what's pushing the AI. It seems like harnesses is the main thing pushing AI ever since CoT. | | |
| ▲ | Spartan-S63 an hour ago | parent | next [-] | | I find that OpenAI's agentic tools and models are better for building human-maintainable software. Meanwhile, Anthropic seems to be cosplaying Apple while missing out on all the exceptional engineering required to create something that polished. Their admission of predominately using Claude with little human oversight and their stealth mode is an indictment of a poor engineering culture, from what I can surmise. | | |
| ▲ | someguyiguess an hour ago | parent [-] | | Serious question: what is the secret to getting Codex to write decent code? I am on Windows. Maybe that is the issue, but I can't seem to get Codex to function anywhere near the level that I was previously able to get with even Claude Sonnet. Does Codex just not work well with Windows yet? |
| |
| ▲ | someguyiguess an hour ago | parent | prev | next [-] | | I've had the exact opposite experience. For various reasons, I've had to move from Claude to Codex and the rate at which it burns tokens for the same output I would get from Claude is ridiculous. I'm probably burning tokens at a rate that is at least twice as much as I was when using Opus 4.5 for coding tasks and still finding that just manually coding is easier than trying to get Codex to write functional code. | |
| ▲ | greenavocado an hour ago | parent | prev [-] | | How smart a model is varies hour over hour, tracked over here: https://aistupidlevel.info/ |
| |
| ▲ | wsatb 2 hours ago | parent | prev | next [-] | | I guess enjoy it while it lasts? OpenAI won't be able to subsidize that forever either. | | |
| ▲ | windexh8er an hour ago | parent | next [-] | | Agreed. I think the Chinese labs are proving that OpenAI and Anthropic don't have a moat in almost every aspect, especially pricing. I also think people are getting annoyed with the constant lift and shift. I've seen more folks drop Claude Code and Codex, specifically, because of the lock-in it provides the providers. I'm curious to see how people standardize on tooling adjacent and if Anthropic, Google or OAI move to block utilization akin to the games Anthropic has been playing as of late. I think the end game is routed model usage and SLMs. I think Apple is going to prove this in the consumer space pretty handily and I'm curious how the Android ecosystem responds since the hardware is considerably lacking in model performance. I think Apple has a huge opportunity here, as much as I don't like their current ecosystem of walled garden. They did position themselves very well with ARM and custom chips for their hardware. Hopefully the broader ecosystem of ARM and Linux are able to make some headway and we see a more formalized, and broadly accepted, architecture to capitalize on. | | |
| ▲ | Qhemlomo a minute ago | parent | next [-] | | Big companies are not doing OpenRouter. My company has an agreement with the big providers and while i'm pretty sure they think about how to get budget back, its an competitive advantage and normal people will not learn different model behaviours. At least for now. | |
| ▲ | maxdo an hour ago | parent | prev [-] | | I see exactly opposite . Chinese models fails under any complex scenarios, while us labs raise the price , that's a sign of confidence. | | |
| ▲ | re-thc 37 minutes ago | parent [-] | | > while us labs raise the price , that's a sign of confidence Regardless of what others are doing, US labs here are just rushing to IPO. It's NOT a sign of confidence. It's the equivalent of saying you have confidence in SpaceX making revenue by renting out their data center (instead of their AI making bank). |
|
| |
| ▲ | pyeri 20 minutes ago | parent | prev | next [-] | | My bet is they'll keep subsidizing for a considerable period of time, at least 1-2 decades more. Most AI companies are just testing the waters with paid tiers right now, their greatest fear with increased pricing is folks reverting back to wikipedia, stack-overflow and other public domain organic activity buzzing back to life; that will kill any RoI potential in LLMs forever. They're playing the wait game instead, observing how the digital sphere reacts to every little increase in price. If that weren't the case, they'd be pricing at lucrative premiums and even gotten away in short-term considering the increased dependency in the enterprise world. But that'd be like killing for the golden egg too soon and losing all long-term potential. Once the folks are so addicted to LLMs that even writing a hello world program sounds like a nightmare and coming up with an article draft feels like reinventing Egyptian glyphs, that's when the real pricing hammer will come. | | |
| ▲ | wsatb 10 minutes ago | parent [-] | | Anthropic and OpenAI won't be around in 1-2 decades if this is their long term plan. People are not going to revert, but go elsewhere. China is proving that it can be done cheaper. |
| |
| ▲ | flatline 2 hours ago | parent | prev | next [-] | | I don't think anyone has a firm grasp on actual inference costs -- including the research and training that has gone into those models. We've got near-frontier capabilities from open source models from China at pennies on the dollar compared to US big tech rollouts. OpenAI and Anthropic are heavily subsidizing their inference -- no wait, they are charging the most they can get away with before going public. Where is the truth? | | |
| ▲ | andrewmutz 2 hours ago | parent | next [-] | | Both can be true. They can be charging what the market will bear, and still be charging less than their costs of running it. | | |
| ▲ | wyre an hour ago | parent [-] | | There is no way I'm believing DeepSeek can charge less than $1 USD for their pro model while Opus costs over 25x more, yet their price is less than the cost of running it? | | |
| ▲ | kube-system 4 minutes ago | parent [-] | | China subsidizes strategic industries. And DeepSeek specifically has said they have no commercialization plans. |
|
| |
| ▲ | InsideOutSanta an hour ago | parent | prev | next [-] | | > I don't think anyone has a firm grasp on actual inference costs -- including the research and training that has gone into those models We know roughly how much these companies spend and what their revenues are. Based on that, they'd have to more than double revenue (without spending more money) just to stay even, and that's not good enough given how deep in the hole they are. > OpenAI and Anthropic are heavily subsidizing their inference -- no wait, they are charging the most they can get away with before going public. Where is the truth? Both are true. I mean, I'd be willing to spend a bit more than I do now, but not more than double, and neither are most companies. The company I work for is currently investigating how to reduce LLM spend, not looking to spend more. | |
| ▲ | dontlikeyoueith an hour ago | parent | prev | next [-] | | > OpenAI and Anthropic are heavily subsidizing their inference -- no wait, they are charging the most they can get away with before going public. Where is the truth? Both. They are charging the most they can get away with and that amount is still heavily subsidized by VC capital. | |
| ▲ | pimeys an hour ago | parent | prev | next [-] | | We pay by token at work. I just finished one session with Opus that was 4000 dollars. In about three days. Now that 200USD subscription starts to feel cheap... | | |
| ▲ | zozbot234 an hour ago | parent | next [-] | | That would be about ~300 tok/s over 72 hours at Claude Fable output token prices? I'm not sure that this passes a sanity test. | | | |
| ▲ | rubyn00bie 38 minutes ago | parent | prev [-] | | Just outta curiosity, as I’ve never gotten a spend anywhere near that, what variant were you using? Like max context window and fast mode? Or was it just chugging along non stop for three days? | | |
| ▲ | pimeys 25 minutes ago | parent [-] | | Fast mode max content window. The task was: replace all 1600+ queries from one database to another and make the whole integration test pass. We did multiple passes, with different concerns when changing from database to another. My OpenCode session right now says $4,365.02. I haven't gotten close to this either before, but now we wanted to move fast because this branch gets conflicts all the time and we want to get over with the migration asap. |
|
| |
| ▲ | schaefer an hour ago | parent | prev | next [-] | | > I don't think anyone has a firm grasp on actual inference costs. There are huge numbers of users (myself included) that do have an exact idea of what inference costs on open models. Because we can buy tokens from 3rd parties that have no motivation to subsidize our use. That's to say, there's a fair marketplace[1] and we're hanging out there. If you want to say "I don't think anyone has a firm grasp on actual inference costs on these proprietary/closed models", then I could agree with that. [1]: https://openrouter.ai/rankings#leaderboard | |
| ▲ | logicchains an hour ago | parent | prev | next [-] | | We have a firm grasp on actual inference costs from the various open weights model providers on OpenRouter. They don't have the money to subsidize inference and it's quite a competitive market, so the prices are representative of the costs. | |
| ▲ | MichaelMedbed 2 hours ago | parent | prev [-] | | [flagged] | | |
| ▲ | kllrnohj 2 hours ago | parent | next [-] | | regardless of whether that's true or not, US companies doing hosted inference of the models coming out of China are also significantly cheaper than those from OpenAI or Anthropic | |
| ▲ | polski-g 2 hours ago | parent | prev [-] | | Not relevant to the post. |
|
| |
| ▲ | ChrisMarshallNY 2 hours ago | parent | prev | next [-] | | I'm planning on switching from the $20/month to the $100/month plan. It's worth it, and I can afford it, but I am not really the right type of user for token-based usage. It's all for personal and free work. | | |
| ▲ | micah94 an hour ago | parent | next [-] | | Just a personal anecdote but I have not hit any more thresholds or limits since switching to the MAX plan and so far, it's been worth it. But I do wonder how long even this will last... | | |
| ▲ | ygjb an hour ago | parent [-] | | I think subscription models are sustainable, but longer term, we should probably expect to see more prompt optimization happening in the providers inference pipeline. For example, unless you explicitly tell the agent or API to use a specific model, fronting the inference layer with a caching prompt classifier to determine which model to use, and automatically select the lowest cost model would probably already save alot of money (IDK if Claude/OpenAI do this on the backend, but several services I have worked on do some things like this to reduce costs of delivery customer facing inference at scale). | | |
| ▲ | Majromax an hour ago | parent | next [-] | | > fronting the inference layer with a caching prompt classifier to determine which model to use, and automatically select the lowest cost model would probably already save alot of money Unfortunately, that doesn't work within a single session. The K-V cache of a model is intertwined with the model's configuration. Switching models invalidates the cache, meaning everything up to the point of the switchover is processed like a new, uncached input token. Per Anthropic's pricing doc, an Opus 4.8 cache hit costs 50¢/MTok, while Haiku costs $1/MTok for uncached input. Model selection works best if sessions are short and self-contained, particularly if the first few interactions can reliably classify the model need. That probably covers most 'support chatbot' use-cases, but it doesn't describe the kinds of heavy agentic automation that really chews through token budgets. | | |
| ▲ | zozbot234 38 minutes ago | parent | next [-] | | > The K-V cache of a model is intertwined with the model's configuration. I don't think this is true if you simply quantize the model or run it with fewer active experts? The underlying weights would stay the same. You could also play further tricks with skipping some of the model's middle layers outright, which works surprisingly well due to how skip connections are used. | |
| ▲ | ygjb an hour ago | parent | prev [-] | | There is a definite financial incentive for people smarter than me to solve the problem, and I don't generally bet against businesses finding ways to reduce costs :) |
| |
| ▲ | wahnfrieden an hour ago | parent | prev [-] | | ChatGPT does this and codex will eventually. They’ve stated it’s the future. |
|
| |
| ▲ | rnxrx an hour ago | parent | prev [-] | | I have the $100 plan and had almost never run out of credits until I started using the ultracode / workstreams feature w/Opus 4.8..at which point I managed to blow the full 6 hour allocation in like 20 minutes, or so. In fairness, it did some amazing things with the extracted information, but it also strongly suggested that I'd need the $200 subscription *plus* a budget for extra usage. |
| |
| ▲ | andai 2 hours ago | parent | prev | next [-] | | A few weeks ago they massively cut usage on free tier. | |
| ▲ | gck1 an hour ago | parent | prev [-] | | Nothing is subsidized. Subscriptions are profitable for both Anthropic and OpenAI. Anthropic wanting to switch billing to API rates is them just wanting to generate more profit. | | |
| ▲ | InsideOutSanta an hour ago | parent | next [-] | | > Nothing is subsidized. Subscriptions are profitable for both Anthropic and OpenAI. Even if subscriptions are locally profitable (i. e., the cost of the subscription covers the cost of inference), they're still subsidized because they don't cover training and running the company; otherwise, these companies would be profitable. | | |
| ▲ | gck1 40 minutes ago | parent [-] | | I can see that being true, and it very likely is true. But isn't infinite VC money and no incentives to optimize operations the reason behind that? Take a look at China for example - they have no access to NVIDIA, so they're trying to build their own hardware, they have no unlimited funding, so they try to optimize things. And Anthropic is complete opposite of that - if NVIDIA were to triple their prices tomorrow, Anthropic would still pay them. In the end, either we all somehow go mad and start paying Anthropic tens of thousands of dollars per month so support this madness, or we will go with whoever isn't lighting cash on fire. | | |
| ▲ | re-thc 30 minutes ago | parent [-] | | > Take a look at China for example - they have no access to NVIDIA Not true. Stop following US media spam if needed. 1. Very recently, the US did close a loophole on sanctions that allowed Chinese companies to use NVIDIA hardware outside of China i.e. before that was closed they all had access. The trick was train outside, do adjustments, ship the disks back and use non-NVIDIA in China, but at least the training and endpoints not hosted in China could all use NVIDIA. 2. There's been plenty of reports including fines and bans e.g. to Supermicro on smuggling NVIDIA hardware to China. I doubt it has been stopped. You can't catch everyone. |
|
| |
| ▲ | wsatb an hour ago | parent | prev | next [-] | | "Nothing is subsidized" is a wild take. They might be making money on some users, perhaps even most users, but certainly not all. Also, "subsidized" doesn't just mean on compute. | |
| ▲ | y1n0 an hour ago | parent | prev | next [-] | | That's interesting. Do you have anything to back that claim up? | | |
| ▲ | gck1 an hour ago | parent [-] | | I do, and it's called DeepSeek's pricing table. At the same time, "subscriptions are subsidized" cohort have no data whatsoever, and yet they're in every thread. Granted, it could still mean that Anthropic just chooses to lose money - but that's Anthropic's choice. DeepSeek has proven that inference can be much, much cheaper than what Anthropic advertises on their API rates page. | | |
| ▲ | nickthegreek 34 minutes ago | parent [-] | | > Granted, it could still mean that Anthropic just chooses to lose money - Then the cost is being subsidized by investor capital, but it is still subsidized. |
|
| |
| ▲ | FrustratedMonky 37 minutes ago | parent | prev [-] | | "Nothing is subsidized" So they are profitable? I think you are mismatching accounting terms. You can't say the 'subscriptions' are profitable without accounting for the cost of making the model that is the source of the subscription. They are heavily subsidized by the shareholders. Investing, running at a loss, with hope of some future profitability. | | |
| ▲ | gck1 23 minutes ago | parent [-] | | And yet, that is completely uninteresting to their user base. If saner factory can sell you the same tool at a fraction of the cost of a gold plated factory, your choice is going to be obvious. |
|
|
| |
| ▲ | cortesoft 34 minutes ago | parent | prev | next [-] | | I have been using both codex and Claude in my day to day, trying to not get to attached to one. I want to be able to work with any provider in case one of them does something bad. | |
| ▲ | rvshchwl an hour ago | parent | prev | next [-] | | I've found Codex to be the better subscription for OpenClaw, because the limits are indeed very generous. However, I've found more and more that Claude Routines/Scheduled agents can replace all the tasks I use OpenClaw for, so I've been slowly switching over to Claude Code. Aside from OpenClaw, I don't find a lot of value in Codex as a harness on it's own. | |
| ▲ | knuckleheads 2 hours ago | parent | prev | next [-] | | I feel like Codex made a big push to run everything on your laptop. With Claude, I get 4 cpu's, a fair amount of ram and 30gb for every one of my dumb ideas for free in the cloud containers. Codex used to be similar, but last time I tried it just kept pushing me to run it locally on my laptop, which I really did not want to do with 20 requests going at once. That's the main advantage for me at the moment. | | |
| ▲ | simjnd 2 hours ago | parent | next [-] | | What runs in cloud containers? The dev servers, builds, etc.? I tried to quickly glance at the Claude website and it doesn't mention cloud containers on their pricing page. | |
| ▲ | zhshhan an hour ago | parent | prev [-] | | "cloud containers" do you mean Claude Code on the web? Codex also has similar Codex cloud. | | |
| ▲ | knuckleheads 15 minutes ago | parent [-] | | Yes, correct, they both have the same capabilities, however it felt like codex was pushing me harder to use my local desktop in an annoying way, while claude code was happy to spin up a bunch of dev containers for me in the cloud. |
|
| |
| ▲ | supertroop an hour ago | parent | prev | next [-] | | Do you use a token service like open router or just subscribe to / unsubscribe from various models sequentially? | |
| ▲ | efromvt 30 minutes ago | parent | prev | next [-] | | I do slightly prefer 5.5 for complex work but Claude quota usage has gotten infinitely better since the dark days a few months back - has gone from being infuriating to something I pretty much don’t have to worry about with it as a daily driver. (In fact, hitting GPT weekly quotas is more annoying now). Understand if people are still scarred by the issues + poor comms around them, though. | |
| ▲ | dd8601fn 2 hours ago | parent | prev | next [-] | | I have trouble justifying gpt after that gross stuff with the war department. Though the day is coming when there’s no distinguishing, I’m sure. | | |
| ▲ | beering an hour ago | parent | next [-] | | Right now there are Anthropic engineers deployed in the NSA to help them use their cyber models. The NSA is part of the department of war. | |
| ▲ | lovich an hour ago | parent | prev [-] | | pedantically, the defense department. | | |
| ▲ | jcbrand an hour ago | parent [-] | | "War department" is the older name, not "Defense department". Also, is it really a defense department when you're starting wars of aggression every 15 years or so? | | |
| ▲ | derektank 17 minutes ago | parent [-] | | The War Department has not existed since the passage of the National Security Act of 1947 and the government department has been known as the Department of Defense under US law since the act was amended in 1949. If you have an issue with it, take it up with Congress. |
|
|
| |
| ▲ | rekttrader 10 minutes ago | parent | prev | next [-] | | Wait till you kick the tires of Qwen Coder. | |
| ▲ | shimman 2 hours ago | parent | prev [-] | | I've only ever had the $20 month claude plan but last night took the time to setup opencode + openrouter paying for deepseek + glm. Previous experience, while extremely awkward, I'd hit my limit within one or two chat replies and it'd take me like 4 limit cycles to complete my task. Now I'm able to complete an equivalent task entire task for less than $2 in two cycles (ask -> revise). I'm doing basic web development here utilizing animejs. Nothing too complicated (mostly saving time doing the scaffolding, still write the bulk of animations manually). Truly believe that American companies are going to get completely curb stomped by China due to greed, ineptitude, and violating the social contract. | | |
| ▲ | simjnd 2 hours ago | parent | next [-] | | I've switched from OpenRouter to using Deepseek directly from their platform since OpenRouter providers were pretty flaky and inconsistent. Deepseek V4 Flash is suprisingly capable and insanely cheap. It takes so much to get the session cost to get to $0.01. | | |
| ▲ | efromvt 6 minutes ago | parent [-] | | The openrouter provider flakiness with deepseek was infuriating, but I’m happy in hindsight because direct deepseek has been very pleasant. Shocked by how low spend is. |
| |
| ▲ | nozzlegear 2 hours ago | parent | prev [-] | | > and violating the social contract. I agree with you on pricing, but what do you mean by this? | | |
| ▲ | shimman an hour ago | parent [-] | | Sure, modern American corporations care more about hoarding wealth rather than helping build up US society. Once neoliberalism became the mainstay economic position of the US income inequality has skyrocketed, healthcare costs have increased, childcare is more expensive than university, housing has become both unaffordable + unobtainable. By simply existing costs have increased while life becomes unstable. Why aren't corporations doing more to help workers with childcare? Why aren't they doing more profit sharing with workers? Why aren't they encouraging unions or sectorial bargaining? Why isn't the government mandating any of this? Americans very rarely benefit when US corporations do well. That needs to change. No one benefits if Meta continues making billions in profit every quarter while society suffers from isolation, depression, suicide, and scams from their services. Americans don't benefit if health insurance companies are making massive profits while they can't afford deductibles. Our society has been setup to simply extract wealth in all facets of life. That's a sick society and it needs to change. I'm not saying China does this better, in fact China has some of the worse worker rights out of all the industrialized countries; but at least American consumers would benefit from cheaper higher quality Chinese goods. The world would likely benefit too if America got off the cold war hype train that did nothing to benefit humanity outside of those making weapon systems. | | |
| ▲ | joxdosba an hour ago | parent [-] | | > Why aren't corporations doing more to help workers with childcare? Why aren't they doing more profit sharing with workers? The AI companies sure are a brilliant example of corporations needing to do more to help their employees pay for childcare. | | |
| ▲ | idiotsecant 42 minutes ago | parent [-] | | It's more useful to everyone when you engage with the strongest part of someone's argument |
|
|
|
|
|
|
| ▲ | jrumbut 3 minutes ago | parent | prev | next [-] |
| It could be my use cases, which have always seemed to be outside the wheelhouse of these models, but I find it very hard to downgrade after accessing a more capable model. Opus 4.8 produces output in 15 minutes that is 3-4 hours of my work away from output that used to take me 40ish hours (a solid week of dedicated effort). Last year(-ish, maybe it was 18 months, I forget when the jump happened), the frontier models couldn't touch this work. The output looked like a hardworking intern on their first day. Nice formatting, decent volume of words, but no understanding. So it might work if it turns out to be a substantial leap in capability. |
|
| ▲ | KronisLV 2 minutes ago | parent | prev | next [-] |
| > it feels like they are trying to get subscribers to switch to usage-based billing I think they might be hitting a point where subsidizing the expensive models for subscriptions makes less and less sense. With Opus 4.8, last month I paid 100 USD for the Max subscription and got a token equivalent of 4.1k USD. I imagine that Fable is more expensive to run. |
|
| ▲ | 0erofootprint 2 hours ago | parent | prev | next [-] |
| For me it almost immediately blocked. I had it writing code related to message digests - and it seemed to think it was too gifted for that. Gave the security warning and switched back to 4.8. Whatever... it will probably soon have the API error soon. I have mostly switched to the Codex 200 a month plan. I've found their 5.5 xhigh to be better than Opus 4.8 "ultracode." Also, i have not once seen their servers fail for compute unavailability, unlike Anthropric which happens almost ever hour. |
| |
| ▲ | matheusmoreira 17 minutes ago | parent | next [-] | | I just asked Fable for a complete code review of my lone lisp project. Started out strong. Launched Fable agents, then spent like 10 minutes thinking... And then got interrupted by a switch to Opus 4.8. > Fable 5's safety measures flagged this message for cybersecurity or biology topics. > They may flag safe, normal content as well. > These measures let us bring you Mythos-level capability in other areas sooner, and we're working to refine them. Here are the results of the agentic code review session: ┌──────────────────────────┬───────────────┬────────────────┐
│ Agent │ Fable 5 turns │ Opus 4.8 turns │
├──────────────────────────┼───────────────┼────────────────┤
│ values │ 134 │ 0 │
├──────────────────────────┼───────────────┼────────────────┤
│ data-intrinsics │ 104 │ 0 │
├──────────────────────────┼───────────────┼────────────────┤
│ tools-tests-build │ 81 │ 0 │
├──────────────────────────┼───────────────┼────────────────┤
│ core-intrinsics (failed) │ 25 │ 0 │
├──────────────────────────┼───────────────┼────────────────┤
│ system-memory │ 44 │ 20 │
├──────────────────────────┼───────────────┼────────────────┤
│ reader-modules │ 104 │ 25 │
├──────────────────────────┼───────────────┼────────────────┤
│ linux-startup │ 95 │ 15 │
└──────────────────────────┴───────────────┴────────────────┘
This cost me 16% of my weekly usage. A simple code review of the most critical areas of my project got flagged as a cybersecurity risk. It really made me not want to try it again. | |
| ▲ | kkoncevicius 2 hours ago | parent | prev [-] | | I had a similar experience. I wanted to test it by asking it to summarise a scientific OMICs-related paper. It gave a warning about me potentially developing a bio-weapon or something like that. And switched back to Opus 4.8. |
|
|
| ▲ | joshstrange an hour ago | parent | prev | next [-] |
| I would not use this if you are on a subscription. In <8min it burned my entire 5hr window (which has just reset it appears, I have over 4 hours till it resets) I hadn't used CC at all today aside from this) and then it used up ~$15 more in usage before I could stop it. I am on the $100 Max plan. |
| |
| ▲ | cortesoft 27 minutes ago | parent | next [-] | | The CLI when you select it says it has 2x the usage as opus. Not sure if that matches what you are seeing. I do wonder if you switched models mid-session, you would have lost all your cache. Reloading the context into cache can really eat through your usage. | |
| ▲ | fastball 44 minutes ago | parent | prev | next [-] | | What is your effort level? | |
| ▲ | enraged_camel an hour ago | parent | prev | next [-] | | That’s odd, I used it on a pretty complex refactoring task and it worked for 22 mins and used only 15% of my 5-hour limit. I’m on the $200 Max plan though. | | | |
| ▲ | ZunarJ5 29 minutes ago | parent | prev [-] | | They didn't even reset credits for this lol |
|
|
| ▲ | smith7018 2 hours ago | parent | prev | next [-] |
| Fwiw it's not available on my enterprise account: "Disable zero data retention to unlock Fable 5 access" |
| |
| ▲ | stronglikedan an hour ago | parent | next [-] | | We just blocked it at our org for this reason. They will "retain agent request and output data associated with this model, regardless of you Cursor Privacy Mode setting." | |
| ▲ | sdellis an hour ago | parent | prev [-] | | What does "zero data retention" mean? What kind of data does it need to unlock? | | |
| ▲ | drakythe an hour ago | parent [-] | | The announcement details it. They're storing 30 days of data on all surfaces, first and third party. They claim it is for security purposes so they can review and check for long term jailbreak and distillation efforts. They also, FWIW, say that they've instituted new policies on their end such as logging any human access to the stored data and automated deletion after 30 days in "most" cases (with another link to a document detailing that further). |
|
|
|
| ▲ | kyledrake 2 hours ago | parent | prev | next [-] |
| Considering their apparent nerfing of the end user plans in favor of enterprise clients, is Anthropic still the "more ethical AI company" like everybody loves to tell me all the time? Assuming this isn't just a supply issue on their side, nothing says "ethical AI" like only allowing mega corporations to use it through cost barriers. |
| |
| ▲ | estearum 2 hours ago | parent | next [-] | | You really misunderstand what AI-doom people are worried about if you think this is anywhere near the top (or middle, or bottom) of the list of concerns. | | |
| ▲ | Jackson__ 2 hours ago | parent | next [-] | | If you can't trust them to act ethically on the small scale, why would you expect that to turn around once it gets to a larger much more important scale? How many government sanctioned school bombings does it take for them to quit working with said government? For now we know that number is somewhere between infinity and 1. | | |
| ▲ | estearum 2 hours ago | parent [-] | | It literally does not register as "unethical" at any scale to have different products or prices for different customer tiers. The question of collaboration with USG is a much more complex one, but is not the one raised above. Edit: I'll also add that I doubt any AI-doom people "trust" Anthropic per se. The entire angle of questioning – again – misunderstands the AI-doom argument. You appear to think that if companies behave unethically, they cannot be trusted and they will not produce good outcomes, inversely: if they behave ethically, they can be trusted, and they will produce good outcomes. Any competent AI-doomer would argue that ethics or trust are essentially irrelevant. The entire problem is that people can act totally reasonably, even ethically, and this is not a guarantee of good outcomes. Situations can be created in which completely ethical, reasonable behavior actually produces a bad outcome. You do not need to assume people are bad in order to produce a bad outcome, and inversely you cannot assume that you will get a good outcome from good people. "Arms races" are one class of situations that often have this characteristic. "Bureaucracy" is another class that we encounter a lot in daily life. There's a lot of them! |
| |
| ▲ | throwaway894345 2 hours ago | parent | prev [-] | | Yeah, it's positively precious to think the specific pricing strategy for consumers is the overriding ethical concern with OpenAI, etc. I don't have any particularly strong affinity to any AI company, but comparing pricing to say mass surveillance is ... something else. | | |
| ▲ | kyledrake 2 hours ago | parent [-] | | Your beautiful straw man is negated by the fact that Anthropic seems quite eager to get back on the DoD gravy train https://www.reuters.com/business/aerospace-defense/blacklist... | | |
| ▲ | jnovek an hour ago | parent | next [-] | | Your original comment was about pricing ethics, does Anthropic’s connection to the DoD have anything to do with pricing ethics? They’re in no way coupled, one can be ethical while the other is not. | | |
| ▲ | andriy_koval 27 minutes ago | parent [-] | | even for Pentagon thing, Dario said he doesn't object military AI, but said Claude is not ready YET. I speculate he was afraid of reputational damage from cases if Claude would guide missiles on elementary schools. |
| |
| ▲ | ygjb 2 hours ago | parent | prev | next [-] | | Setting aside the simple fact that there is no ethical consumption under capitalism, the reality is that regardless of how Anthropic feels, it is becoming clear that many, if not all countries regard AI developments as strategic technologies (and they should). Anthropic needs to be at least somewhat in the good graces of a capricious administration that is already under pressure from businesses and citizens to regulate AI companies across multiple different domains, whether it's energy consumption, job displacement, military and defense applications, surveillance, etc. If Anthropic wants to survive, they need to acquire influence with the government that most impacts them as an American company, and a massive exporter of services in the AI space to other countries, otherwise they could get locked down and locked out of the market for national security reasons. It sucks, but sometimes the survival choice is to make an ethical compromise in hopes that you can still be around to make better decisions later. | | |
| ▲ | ericmay 2 hours ago | parent [-] | | > Setting aside the simple fact that there is no ethical consumption under capitalism This "simple" fact needs quite a bit of additional context and work. Making grandiose ethical claims like this can be countered with other grandiose claims such as the fact that there is no ethical existence under communism or socialism. | | |
| ▲ | cleaning 19 minutes ago | parent | next [-] | | It only needs additional context and work if you are unfamiliar with the concepts underlying it. Possibly consider you are out of your depth here, rather than jumping to conclusions. | |
| ▲ | ygjb an hour ago | parent | prev [-] | | Sure. Why not, I'm bored today and waiting for some stuff to finish up :D The fact that there is no ethical consumption under capitalism is not material to whether or not ethical existence is possible under communism or socialism. In order to survive in a capitalist society, one inherently has to make choices that require trade-offs, and those trade-offs are burdened by a history of decisions made not just by the people alive today, but our ancestors as well. Does that mean I walk around chanting "Reparations", "Land-back", or other calls to action? No, but I do acknowledge that there are unresolved issues and as a Canadian, I know we need to do more to resolve treaty issues, and environmental issues, and system discrimination. I also know that Americans need to do better to address systemic discrimination and many, many other issues. It also doesn't mean I want to give back my house, or give away all of my possessions. It just means I try to make good choices and support businesses and people that are open about the trade-offs they make and try to engage as ethically as possible. Acknowledging those facts doesn't absolve us of responsibility, it's a framework that allows folks concerned about whether or not they are doing the right thing to accept the trade-offs that they choose to make and be responsible and accountable for those choices to themselves or their communities. We live in a world with scarce resources. It's possible that with a foundational redesign of the global economy, and the requisite authoritarian government that would be required to force such a redesign, we could eliminate food scarcity, solve energy scarcity, and make sure that everyone has a place to live. Those trade-offs are probably not worth the ethical cost in political and physical violence required to accomplish it. We have seen the trade-offs that happen when the powerful are able to exploit communist or socialist governments. We are seeing the "late stage capitalism" impacts of allowing the powerful to exploit capitalism in democratic societies. Acknowledging that the current capitalist system has lead to the greatest prosperity for the upper echelon (financially) of humanity, and a dramatic reduction in global poverty shouldn't obscure the reality that much of that wealth comes from exploitation of people and the environment. It's a huge problem to unwind, and we can't let the burden of every choice that we make stop us from trying to do better, but we (as in society in general) can't do better if we don't at least acknowledge the compromises we are making along the way, and try to plan to fix it in the future. Probably a topic better suited to beer and a pub setting than HN though :P |
|
| |
| ▲ | estearum 2 hours ago | parent | prev [-] | | Where is your evidence that this is Anthropic backtracking on its ethical and contractual commitments rather than DOD backtracking on its blatantly illegal coercion (which it's almost certainly going to be successfully sued for)? Talk about a strawman! | | |
| ▲ | kyledrake 2 hours ago | parent [-] | | As someone that was in Minneapolis during the ICE raids, including one where a US citizen at a nearby restaurant was thrown in prison for 3 days despite having his passport on hand because he looked asian, it's hard for me to not equivocate the ethics of AI companies actively collaborating with the Trump administration as different flavors of ice cream. | | |
| ▲ | estearum 2 hours ago | parent [-] | | Are the two analytical frameworks available to you just "black and white thinking" or "it's different flavors of ice cream?" | | |
|
|
|
|
| |
| ▲ | DonsDiscountGas 2 hours ago | parent | prev | next [-] | | I don't think offering a product under a certain set of terms obligates a company to maintain that offering forever. The bait and switch is certainly annoying but seeing as they're very upfront about it you can't say you weren't warned. Don't like it? Don't use it. | |
| ▲ | eli 28 minutes ago | parent | prev | next [-] | | It's unethical to price it in a way not everyone can afford? | |
| ▲ | wongarsu 2 hours ago | parent | prev | next [-] | | I wouldn't call Anthropic ethical. But between Anthropic and OpenAI, Anthropic is the more ethical one | |
| ▲ | brianmcnulty 2 hours ago | parent | prev | next [-] | | Why would you have ethics when you could get that IPO money instead? | |
| ▲ | 2 hours ago | parent | prev | next [-] | | [deleted] | |
| ▲ | xvector 2 hours ago | parent | prev | next [-] | | Yup - who cares about x-risk or red lines for domestic mass surveillance anyways? I draw my red lines at prioritizing profitable customers when heavily resource constrained. That's the true definition of evilness! | |
| ▲ | Maken 2 hours ago | parent | prev | next [-] | | The bar is just too low. | |
| ▲ | fridder 2 hours ago | parent | prev [-] | | More ethical in some areas, actively user hostile in others |
|
|
| ▲ | nickandbro 2 hours ago | parent | prev | next [-] |
| Get them addicted then cut them off. Oldest trick in the book. |
| |
| ▲ | toomuchtodo 2 hours ago | parent [-] | | More of a free trial to those authenticated and qualified with existing payment. Subscription billing is going away for sure though eventually based on the economics. Token “all you can eat” is a capital furnace otherwise. (I’m highly confident open models will eventually achieve a similar performance benchmark with distillation over time) | | |
| ▲ | CuriouslyC 2 hours ago | parent [-] | | Subs lose money on individuals to get those individuals to force their companies to pay for the corporate plan. The economics are bad, but so are the economics of grocery stores selling Milk and Bananas at a loss to drive traffic, which they basically ALL do. | | |
| ▲ | eptcyka an hour ago | parent | next [-] | | I pay a lot but barely use it except for some intense days, where the lower plans would have throttled me in like 30 minutes. API billing is still more expensive. If you want to not pay much, go to openrouter and use chinese models. They are cost efficient. | |
| ▲ | HDThoreaun 2 hours ago | parent | prev | next [-] | | I havent seen any evidence showing that subscriptions cost the labs money. | |
| ▲ | toomuchtodo 2 hours ago | parent | prev [-] | | Companies don’t want to pay when the value realized does not exceed the cost. AI Savings Misses 'Should Be Making Executives Uncomfortable,' Bain Says - https://news.ycombinator.com/item?id=48359010 - June 2026 (0 comments) AI sticker shock hits corporate America- https://news.ycombinator.com/item?id=48307098 - May 2026 (146 comments) | | |
| ▲ | CuriouslyC 2 hours ago | parent [-] | | What's the realized value of not losing your engineers because you're letting them use their preferred tools? | | |
| ▲ | 2 hours ago | parent | next [-] | | [deleted] | |
| ▲ | toomuchtodo 2 hours ago | parent | prev [-] | | Retain and hire the engineers who don’t require heavy use of AI to deliver value? The current SWE job market speaks for itself. Where will you go where they will let you burn up tokens in a high cost of capital macro? ZIRP (zero interest rate policy) is over, software engineers no longer call the shots now that there isn’t vast amounts of capital chasing yield, and that capital bidding up salaries and keeping the labor market for engineers tight. If you are x more productive with generative AI, very shortly you are going to have to prove it with a token budget (or, if you’re lucky, an org willing to spend for on prem hardware for capped token cost, fixed capex vs uncapped opex). The comparison is not SWE vs SWE with AI. It is SWE vs SWE with AI with a constrained token budget ($x/month) delivering the same value at the same or lower cost. If you cannot prove that you are wildly (vs marginally) more productive with the AI, why would they pay for it? Prove it. |
|
|
|
|
|
|
| ▲ | alvis 2 hours ago | parent | prev | next [-] |
| It’s too obvious that antropic need to find way to earn enough revenue before IPO. Claude subscription isn’t earning earning much money I bet |
| |
| ▲ | sigmoid10 2 hours ago | parent | next [-] | | I think they are just prioritizing enterprise customers, because this is were historically they made most money. | | |
| ▲ | dylandevelops an hour ago | parent [-] | | I agree with you here. Unfortunately, this tends to be the case, with smaller developers paying the price. |
| |
| ▲ | sdellis an hour ago | parent | prev | next [-] | | That's a big problem for all of the AI companies. Most people don't find the technology compelling, accurate, or ethical enough to pay for a subscription. Why wouldn't Anthropic just wait until people start subscribing, do some kind of marketing push, or obtain some kind of other sustainable revenue stream, before they go IPO? I wonder if they see the writing on the wall with all of this and want to cash out as quickly as possible? | |
| ▲ | AtlasBarfed an hour ago | parent | prev | next [-] | | That's not how it works. They don't need revenue, they need addicts. Specifically they need businesses that fired people and adapted their business to the products, so when the unsubsidized costs hit the businesses are forced to eat the true costs. Yes they can't afford to give the products for free, but what is essentially happening with AI services is economic dumping, keep costs artificially low to get people to fire everybody, and then Jack the rates once they have Monopoly control | | |
| ▲ | sdellis an hour ago | parent [-] | | But the only companies firing people (and certainly not everybody) are either the companies with an AI or the investment and finance firms that stand to profit from AI. I smell hype. And no company is firing everybody because of A.I. I agree. They need addicts, but they are high on their own supply and everyone else can see the danger in getting hooked. |
| |
| ▲ | 2 hours ago | parent | prev [-] | | [deleted] |
|
|
| ▲ | ltrg 18 minutes ago | parent | prev | next [-] |
| Fable seems very good at finding bugs (unsurprising given Mythos lineage), so this seems a pretty smart strategy. Once you see the bugs it finds in your existing Opus code, it's going to be hard to go back, psychologically speaking. |
|
| ▲ | xpct 2 hours ago | parent | prev | next [-] |
| I agree, this looks like their plan to wane out subscriptions. This will probably come with Opus nerfs later. |
| |
| ▲ | rapind 2 hours ago | parent | next [-] | | I just assume Opus is constantly nerfed based on capacity. I was exclusively Claude for a long time, but the inconsistency in quality, constant outages, and slow downs were too hard to work with. I just use dumb and fast models now. I'm more engaged. I think that the higher the quality of the model, the more you tend to vibe with it, and then the more hallucinations you then miss. I'm not sure which is more productive, but I definitely burn out faster the more I vibe. At some point you're spending your time on forums, discord, or youtube instead of engaged with what you're building. Or you yak shave about your tooling and end up creating the 600th multi-agent gastown harness and blowing thousands of dollars on tokens to create it only to discover it's too expense to actually use. | | |
| ▲ | dylandevelops an hour ago | parent | next [-] | | I agree with you. The more I vibe code, the less interested I feel in what I'm building. Working with models that force me to think, especially with personal projects, helps me stay engaged and enjoy what I am doing more. | |
| ▲ | winter_blue 2 hours ago | parent | prev | next [-] | | Composer 2.5 Fast that Cursor is giving away for very little has been amazing. | | | |
| ▲ | aplomb1026 2 hours ago | parent | prev [-] | | [flagged] |
| |
| ▲ | nonethewiser 2 hours ago | parent | prev | next [-] | | It's possible that they will transition to usage credits but why not take them at their word? To date they have continued to offer better and better models to their subscription plans. | | |
| ▲ | timcobb 2 hours ago | parent [-] | | What's their word? Have they commented? Upd: I meant big picture, not with respect to this model release. Where do subscriptions figure into their strategic vision. Will consumers end up paying enterprise prices in the future? | | |
| ▲ | KyleJune 2 hours ago | parent | next [-] | | In the blog post they say when sufficient capacity allows them to do so they aim to restore Fable 5 as a standart part of subscription plans and intend to do so as quickly as they can. | |
| ▲ | dbbk 2 hours ago | parent | prev | next [-] | | Read it again | | |
| ▲ | timcobb 2 hours ago | parent [-] | | I did, I'm not seeing anything about the future of subscriptions at Athropic. |
| |
| ▲ | ls612 2 hours ago | parent | prev [-] | | In TFA they say they intend to restore Fable 5 to subscription plans some time after June 22. That is what "take them at their word" means. |
|
| |
| ▲ | taormina 2 hours ago | parent | prev | next [-] | | Those already landed! Oh, you weren't talking about 4.8? | | |
| ▲ | piva00 2 hours ago | parent [-] | | Even Opus 4.7 felt like a regression from 4.6, consumed a lot more tokens while I didn't experience any substantial improvements. The company I work at simply rolled back to 4.6 on everyone's configurations, disabling the toggle for 4.7. | | |
| ▲ | taormina 2 hours ago | parent [-] | | 4.6 has been my happy place for getting anything done for a while now. |
|
| |
| ▲ | xvector 2 hours ago | parent | prev [-] | | HN needs to take a chill pill. Could it be that Mythos is expensive and they just want to give people a taste of it? I mean the alternative is not offering it at all? | | |
| ▲ | 8note 2 hours ago | parent [-] | | its unclear how they can offer it broadly but only for half a month. why do they have capacity now that they wont in a few weeks? | | |
|
|
|
| ▲ | irthomasthomas an hour ago | parent | prev | next [-] |
| This is just the sales team doing their thing, applying the Law of Scarcity to drive demand. It's the same exact speed as opus >=4.5, sonnet 4.5, and twice the speed of opus <=4.1 It must have about the same active parameters, or else its a larger model running in turbo mode (smaller batches) and being heavily subsidized for some reason. But given most of the benchmarks are within 5% I doubt it is a much larger model. Most perplexing. |
|
| ▲ | timcobb 2 hours ago | parent | prev | next [-] |
| Ooof so are we thinking that in the next 6-12 months subscriptions will be replaced with paying retail like enterprise currently? |
| |
| ▲ | aseipp 2 hours ago | parent | next [-] | | They almost certainly already make a fuckload more money off API pricing than they do subscriptions, even if there might be more total subscription users. So offering subscriptions even at some loss is probably going to continue. Honestly, I'd be surprised if they even lost money on most subs; there are definitely Token Whales out there who mess up all the accounting up, though. Realistically I think Anthropic just has insane demand but finite capacity to run models, and Fable will just make them more money if they dedicate it to API pricing. I suspect the goal here is something like: get individual engineers/PMs on their personal plans to taste Fable and then go to their meetings and say "Yes doubling the price of every single input/output token is a good idea, boss". | | |
| ▲ | timcobb an hour ago | parent [-] | | But I don't want to be the developer who goes and says we must pay all this money for these tokens. I don't know who wants to be that developer. | | |
| |
| ▲ | CuriouslyC 2 hours ago | parent | prev | next [-] | | I don't think they'll phase out subscriptions ever, their whole play has been to drive demand from the bottom up. Get engineers hooked on building with claude at home, then get them to demand the ability to use it at work, and bend over their employer with no lube. They'll probably tighten the quotas to reign in whales though. | |
| ▲ | thewebguyd 2 hours ago | parent | prev | next [-] | | I certainly hope not. PAYG is not predictable enough for smaller companies or individuals. Where I work (non-tech company), PAYG would never fly. We aren't big enough for that. Of course, you can set usage budgets, but there's a pretty big difference between $200/user/month vs. the equivalent PAYG usage being closer to $1,000/user/month, if you currently use the subscription plan to its limits each week. Going PAYG only will effectively take these tools away from a huge amount of people and accelerate the push for local LLMs. OTOH, accelerating the push for local LLMs would also be fine with me. | |
| ▲ | ygjb 2 hours ago | parent | prev [-] | | I doubt it, given the importance of those subscriptions for building and maintaining market awareness. The AI landscape is changing rapidly, and with Apple announcing the option to change the AI backend, and potential requirements enable AI choices as well, similar to EU browser choice requirements (this is more reading tea leaves than any actual requirements I am aware of). The new OS changes coming to support Googlebook, and deep Copilot/AI integration into Windows will make maintaining user facing subscriptions essential for independent model developers like OpenAI, Anthropic, and Mistal to remain relevant longer term. If the don't maintain that relevance there is increasing likelihood that they will get consumed by other companies whether it's Apple, Microsoft or Google to form a foundation for their OS, or other cloud providers. | | |
| ▲ | timcobb 2 hours ago | parent [-] | | That make sense, but what about the specific bifurcation we're seeing here of super primo models versus still good models being available to subscriptions? It's kind of annoying not getting access to the primo model and paying 200 bucks a month. I understand 200 bucks a month is basically nothing though. Like I don't totally understand why they'd let me have it for a couple weeks and then take it away and say I can have it but I have to pay retail and retail is like $1,000 a day. It's better to have loved and lost than to have never loved at all?? | | |
| ▲ | ygjb an hour ago | parent [-] | | It's a trade-off. Every hyperscaler is buying and building compute capacity as fast as they can dodge red tape. There is limited compute capacity, and scarcity is a real thing. As a consumer I can choose to buy subscriptions to a range of things, including $5 droplets or VMs on a broad range of cloud hosting providers. I can even buy cheap bare metal at a bunch of providers at an affordable retail rate. I can also buy "unlimited" AI packages that will be optimized to fit the cost model from a variety of services, with different impacts, such as rolling outages when I consume a daily or hourly allotment. Right now VC and the investor class are subsidizing the rapid evolution of the services and availability, but that VC is running out. In more traditional economies, AI would have developed and rolled out more slowly, and through metered subscriptions, with the eventual rolling out of "unlimited" packages like telephone, internet, or cell services once the market became commoditized. We have seen a big inversion of that with the race to "win" AI marketshare. Now the true cost is being exposed, and the most competitive and capable models are hideously expensive to operate, so it makes sense that we are moving to metered billing for a utility service. If you want gas, you can buy regular or premium. If you have a premium car you definitely want the premium, but for most people regular is good. Give it a couple of years, and the survivors will settle around fairly industry standard models of consumer grade services, pro-sumer accounts, and business/enterprise models. Things are still shaking out, but I get the sadness. Luckily I work at a big tech company who is banging the drum on doing experimentation so I use my prosumer claude pro and other accounts at home for hobby stuff, and save my heavy lifting and potentially experimentation for work :P |
|
|
|
|
| ▲ | dack an hour ago | parent | prev | next [-] |
| i doubt that's the goal for them. i bet they just really don't have capacity for people using it a ton, yet they wanted people to be able to try it out while it's new. so they compromised and made it temporarily available. and then hope they can get costs down or capacity up so they can make it more available again |
| |
| ▲ | InsideOutSanta an hour ago | parent [-] | | I think the goal is "private citizens: subscriptions; corporations: per-token billing." It's getting people addicted to LLMs on cheap subscriptions so that they can then force companies to pay for expensive inference. |
|
|
| ▲ | irthomasthomas 20 minutes ago | parent | prev | next [-] |
| "we’ve implemented new interventions that limit Claude’s effectiveness for requests targeting frontier LLM development (for example, on building pretraining pipelines, distributed training infrastructure, or ML accelerator design). ... Unlike our interventions for cybersecurity, biology and chemistry, and distillation attempts, these safeguards will not be visible to the user." |
| |
| ▲ | altcognito 12 minutes ago | parent [-] | | Where is this text coming from? [edit] -- I see that this comes from the system card -- dang merged the comments from the other discussion so that explains the confusion. |
|
|
| ▲ | daft_pink 2 hours ago | parent | prev | next [-] |
| I’m just about ready to cancel my small business 5 user plan with max licenses, because although cowork is really great. I just find OpenAI/Codex to be a lot better most of the time. |
|
| ▲ | nicce 2 hours ago | parent | prev | next [-] |
| > The "offer, then remove" aspect is a bit eyebrow-raising -- it feels like they are trying to get subscribers to switch to usage-based billing, which makes me wonder if we'll ever get it after that June 22nd window. Probably all about the IPO. |
|
| ▲ | matheusmoreira 2 hours ago | parent | prev | next [-] |
| This is really sad... I really didn't want to be priced out of these models but it looks like that's going to happen sooner rather than later. |
| |
|
| ▲ | Aleleo76 2 hours ago | parent | prev | next [-] |
| Pay-as-you-go billing is a kind of drug, I use it every now and then when I'm working on a project with Opus, in a moment you spend a fortune |
|
| ▲ | lisperforlife an hour ago | parent | prev | next [-] |
| My guess is that it is a massive model similar to GPT 4.5 and $10/$50 pricing is for its output will discourage people from using it. I also read safety = nerfed. |
|
| ▲ | ABS 2 hours ago | parent | prev | next [-] |
| also: Fable takes 2× the usage of Opus |
|
| ▲ | dirkc an hour ago | parent | prev | next [-] |
| This serves as a good reminder that relying on AI models is borrowing your tech from someone else. They can take it away or raise the prices arbitrarily. If you rely on this as a core part of your business/profession, you will be at their mercy and subject to whatever whims or challenges they have. |
|
| ▲ | DonsDiscountGas 2 hours ago | parent | prev | next [-] |
| I expect that depends on demand, feedback, and whether GPT-6.0 gets released and is competitive |
|
| ▲ | oersted 2 hours ago | parent | prev | next [-] |
| > Pricing for both models is $10 per million input tokens and $50 per million output tokens. The step-up in intelligence looks massive (we'll see in practice), but the price is getting to a point where it's making me question if it's even worth giving it a try. Good competitors will probably be out soon, which should level the playing field. I am more excited about that, just the fact that they showed that such an improvement is possible. I'm okay waiting a bit longer for this to become attainable for plebs like me. |
| |
| ▲ | xyzsparetimexyz 2 hours ago | parent | next [-] | | This is probably the end of 'use the best model no matter the price' | |
| ▲ | kolinko 2 hours ago | parent | prev | next [-] | | The pricing can be a bit deceptive though. A good model can deliver the same results in fewer tokens. Kind of like billing a programmer by the hour. | | |
| ▲ | zyuiop 14 minutes ago | parent [-] | | Sadly this does not seem to be the case here: if you read the announcement entirely, they include a "cost per task" metric which basically continues the trend of their previous models. So yes, tasks will cost you more, but results will be better - allegedly. |
| |
| ▲ | sourcecodeplz 2 hours ago | parent | prev [-] | | Why wouldn't it be? How much would you pay a scientist at this point to think about a problem for you and give you a solution? |
|
|
| ▲ | clementg 2 hours ago | parent | prev | next [-] |
| I really don't want this to start being the norm |
| |
| ▲ | baggachipz 2 hours ago | parent [-] | | I don't see how it won't be. They lose insane amounts of money on subscription plans. I'm sure they still lose money on usage-based billing, but probably not as much. | | |
| ▲ | JumpCrisscross 2 hours ago | parent | next [-] | | > They lose insane amounts of money on subscription plans Do we know this? I’ve seen evidence they lose money on heavy users. But so do gyms. | | |
| ▲ | saaaaaam 2 hours ago | parent | next [-] | | How do gyms lose money on heavy users? A heavy gym user isn’t really costing the gym anything extra as far as I can see. | | |
| ▲ | JumpCrisscross 2 hours ago | parent [-] | | > How do gyms lose money on heavy users? Most gyms sell more subscriptions than they can fit under their roof at one time. If a gym only sells to heavy users, it will either be constantly turning members away or have to buy more equipment. Its equipment will wear off faster. Depending on amenities, it will go through towels, soap, water, et cetera faster, too. | | |
| ▲ | tripleee 2 hours ago | parent [-] | | Gym equipment lasts 10+ years in a commercial gym, at $50/mo that's a minimum of $6k paid from a single person. Unless they're really, seriously wasteful with the soap.. there's no chance a gym is losing money on a heavy user | | |
| ▲ | rafram an hour ago | parent [-] | | It depends on the gym and their business model! A super-budget gym like Planet Fitness that charges $15/month is going to lose money on heavy users, but they count on most of their members being infrequent gym-goers. A luxury gym like Equinox that charges $300/month can target heavy users without any issues, and they'd actually rather members go more so they stay and spend money on expensive salads and smoothies. Right now all these AI subscriptions are priced like Planet Fitness, but they're used like Equinox. They're hoping that the new a la carte offerings will move their pricing more in that direction as well. |
|
|
| |
| ▲ | charcircuit an hour ago | parent | prev [-] | | >I’ve seen evidence they lose money on heavy users. Where? | | |
| ▲ | JumpCrisscross 36 minutes ago | parent [-] | | There are tons of blog posts where folks work out the API cost of their usage and find it well above subscription cost. | | |
| ▲ | otterley 24 minutes ago | parent [-] | | That doesn't mean the company is losing money in aggregate on these subscriptions. Buffets are still in business even though some people gorge themselves silly at them. The incremental cost may exceed the incremental for a particular person or minority group, but that's not how these businesses measure profitability. |
|
|
| |
| ▲ | cautiouscat 2 hours ago | parent | prev [-] | | I assume consumers aren’t a big note in their bottom line. I’m not actually very sure about that, just an assumption. What I wonder however is if these tools will become something I use at work only. $100/month is already a massive stretch budget wise. If these models keep devouring tokens there’s no way I’d get the same usage time out of them for $100 in usage credits. I just don’t think I’d use them much at all at home. |
|
|
|
| ▲ | a-dub an hour ago | parent | prev | next [-] |
| the claimed inference cost is 2x. if that is true, it is massive and remarkable that they're able to do anything like this at all. |
|
| ▲ | meowface 2 hours ago | parent | prev | next [-] |
| It's very disappointing but I'm assuming it's for rational reasons on their part. |
|
| ▲ | deanc an hour ago | parent | prev | next [-] |
| But it's not and it's highly disingenuous to frame it like this. Quote directly from Claude code, moments ago: > Fable 5 · Most capable for your hardest and longest-running tasks · Uses your limits ~2× faster than Opus |
|
| ▲ | systemvoltage an hour ago | parent | prev | next [-] |
| It's interesting that we are seeing a time when subscriptions are not preferred and usage-based billing is. Pay-as-you go isn't a common thing in SaaS. For example, except for AWS SES, all email providers are bulk-subscription based. |
|
| ▲ | nutjob2 2 hours ago | parent | prev | next [-] |
| > "offer, then remove" Sounds like "bait and wait". If you think about it, the more people pay for these new and more resource hungry models, the longer it takes for them to become no extra cost and the longer it takes the more people are tempted to pay extra. |
|
| ▲ | FergusArgyll 2 hours ago | parent | prev | next [-] |
| I'm about to be priced out of SOTA llms and it's an awful feeling |
| |
| ▲ | wahnfrieden an hour ago | parent [-] | | Not with Codex | | |
| ▲ | FergusArgyll 29 minutes ago | parent [-] | | But they're behind by quite a bit now. CFO (of OAI) Sarah Friar said the next training run will be in the fall on Vera Rubin, I think that means I'll have to wait > 6 months?! |
|
|
|
| ▲ | rvz 2 hours ago | parent | prev | next [-] |
| > * On June 23, we’ll remove Fable 5 from those plans. Using it after that will require usage credits. If capacity allows, we’ll extend the included window. Of course, they are a casino as well giving you free spins at the wheel with their new Fable machine, and it is done on purpose. Once there freebies have expired, many of its users will begin to gamble more on the new casino machine and will realize that it is expensive. |
| |
| ▲ | xvector 2 hours ago | parent [-] | | If it's that big of a problem to you, you're free to just... not use the freebie? | | |
| ▲ | cautiouscat 2 hours ago | parent | next [-] | | It’s an interesting thing to bring up because it’s this classic thing we’ve seen for decades now. The ramifications go beyond the individual which is why I assume they mentioned it. They don’t need to use it/not use it for it to have interesting implications. | | |
| ▲ | xvector 2 hours ago | parent [-] | | so it'd be preferable if they didn't include the model at all? | | |
| ▲ | cautiouscat 2 hours ago | parent [-] | | I didn’t say that and I don’t have a feeling on that either way. But this is a limited time trial and calling it out as such is valid. Is it nice we get the trial? Sure. Is it also a common play in the playbook of tech companies? Yes. |
|
| |
| ▲ | danslo 2 hours ago | parent | prev [-] | | It's not a freebie, it still requires a subscription and burns tokens twice as fast as Opus. |
|
|
|
| ▲ | aray07 2 hours ago | parent | prev | next [-] |
| i have never seen this before - where you offer something and then take that away |
| |
|
| ▲ | firemelt 2 hours ago | parent | prev | next [-] |
| damn they are drugs dealer |
|
| ▲ | 2 hours ago | parent | prev | next [-] |
| [deleted] |
|
| ▲ | OOTW an hour ago | parent | prev | next [-] |
| [dead] |
|
| ▲ | AAYALAG 32 minutes ago | parent | prev [-] |
| [dead] |