Remix.run Logo
botw44 7 hours ago

The whole thesis falls apart though. You can't be on your way to "power over everything" and get distilled into free Chinese models within months. Pick one.

The bottleneck is compute and data, not the model. That's why they could only gate it for a bit. The ITAR thing proves it: no nationality controls in place, so the only option was killing the whole thing. Not exactly what an all-powerful gatekeeper does.

embedding-shape 6 hours ago | parent | next [-]

> The whole thesis falls apart though. You can't be on your way to "power over everything" and get distilled into free Chinese models within months. Pick one.

But is that last part actually true though? Sure, there might be 600B+ models available for download and local inference if you have the hardware, but does the users who use Anthropic switch over to those even if they're available even as hosted models? Seems like some do, most don't, Anthropic and Claude remains very popular among the people who use LLMs, there is no denying that.

vbezhenar 6 hours ago | parent | next [-]

> does the users who use Anthropic switch over to those even if they're available even as hosted models?

I'm currently spending $200 for Claude. That's around my maximum that I can afford. I could stretch that to $500 I guess. But I saw reports of people spending tens of thousands of dollars with Claude API. That's certainly outside of my budget.

So if/when Anthropic decides to stop subsidizing subscription (if they ever do that thing, I still not sure about that), I'll certainly look at the other options. And available "open weights" LLMs hosted by someone will be my first pick. Right now Claude 4.8 feels very advanced, but things move very fast...

5 hours ago | parent | next [-]
[deleted]
HDThoreaun 5 hours ago | parent | prev [-]

The ai labs would be very dumb to get rid of subscriptions. First, I don’t even think the subscriptions are losing money, I suspect they’re around break even, maybe small loses. More importantly, the subscriptions are how they lock in users and convince companies to pay api rates. Without user loyalty that they cultivate with subscriptions businesses will just use the cheapest model on open router or maybe local models.

dominotw an hour ago | parent [-]

> I don’t even think the subscriptions are losing money, I suspect they’re around break even, maybe small loses

whats the basis for this thought

vineyardmike an hour ago | parent | next [-]

For Claude specifically, (1) enterprises pay API rates on top of subscriptions, so subscriptions profitability questions are only relevant for smaller companies and indie devs. Many of whom probably have sporadic or low usage which helps balance some heavy users.

Again, for Claude, (2) it’s rumored that their API rates have around a 90% profit margin. It’s also claimed that the subscription limits get you around 10x tokens per monthly dollar vs buying them with API rates.

Edit: to drive it home. If a tokens true cost to anthropic is 1/10 of what they sell it for at API rates, and a subscription gets you tokens at 1/10 the price, that’s cost-neutral for the business if every subscription uses every token. They’re selling tokens at cost, not at a loss. Many subscription users won’t use their full allotment. That means serving some users doesn’t cost the business as much - which might push the subscription business from cost neutral to profitable.

dominotw an hour ago | parent [-]

not sure how that concludes that subscriptions are not losing money.

HDThoreaun an hour ago | parent | prev [-]

I think theyre charging at least 3x marginal cost for the api and I think that the average subscription uses around half its token allotment. So subscription needs to give 6x as many tokens as the api would cost for it to break even for the lab and that's around where they tend to sit

xboxnolifes 25 minutes ago | parent | prev | next [-]

People dont pivot on a dime. If there stopped being major model improvements for a few years and equivalent free models have been out during the same period, we will see people slowly move over to competitors.

FuriouslyAdrift 5 hours ago | parent | prev | next [-]

The hotness we are seeing is smaller 'expert' models with an 'orchestrator' model in front that evaulates the prompts and routes to the appropiate small models and then synthesizes the collected answer. Easier to split across many smaller, cheaper servers and more efficient than a huge monolithic model.

losvedir 5 hours ago | parent [-]

Do you have more info about this? I can't tell if you're being misled by the unfortunate "Mixture of Experts" terminology (which don't work the way you're describing), or alluding to something different.

Or, maybe I'm wrong, but my understanding is: MoE is just an architecture to keep the activated weights smaller per token. The experts get routed basically token-by-token, and the "experts" themselves don't have a semantic domain so the "expert" word was maybe a poor choice.

everforward 3 hours ago | parent | next [-]

No, this is an agent-level thing, not a feature of the model (ish, unsure for Fable).

You talk to a smart, heavy model to build a plan composed of smaller steps. Then you have the heavy model spin up smaller, cheaper LLMs to actually implement the tasks.

The heavy model is basically read-only in that mode. It can read files, execute tests, etc, but it can’t write code. It just tracks what needs to be done, offloads the work to dumber LLMs, validates the task is done, and moves on to the next step.

It sort of pushes humans up the stack. Instead of having a human sitting there prompting the LLM to start the next task, you have another LLM do that loop.

It’s been on my list to try out.

thesz 4 hours ago | parent | prev | next [-]

https://en.wikipedia.org/wiki/Mixture_of_experts#Sparsely-ga...

"The sparsely-gated MoE layer,[21] published by researchers from Google Brain, uses feedforward networks as experts, and linear-softmax gating. Similar to the previously proposed hard MoE, they achieve sparsity by a weighted sum of only the top-k experts, instead of the weighted sum of all of them."

"Top-k experts," in case of some DeepSeek's models k=1.

bugglebeetle 3 hours ago | parent | prev [-]

See OpenRouter’s recent announcement on a model fusion setup, which they now support via API:

https://openrouter.ai/blog/announcements/fusion-beats-fronti...

halJordan 5 hours ago | parent | prev | next [-]

I don't think you're appropriately understanding the full gamut. The individuals who only spent $200/months will be stuck. But the pie is increasing in size, it's not stagnant. There are a lot of orgs who can afford to run a 1T model and even more that can run a 600B model. These newcomers are what's being fought over

ForHackernews 6 hours ago | parent | prev [-]

> Anthropic and Claude remains very popular among the people who use LLMs

Only because someone else is paying the bills. I use Claude Opus at work because my employer pays for the tokens and encourages me to do it.

At home, I use DeepSeek Flash. It's not as good, but it's maybe 0.7 quality for 0.001 cost.

LaurensBER 5 hours ago | parent | next [-]

Same, I had Deepseek search for, download and transfer (to my Linux emulation machine) the best Dreamcast games yesterday.

GPT refused to do so (citing that it's illegal even though I own the games). Deepseek did a wonderful job for 7 cents.

At work I use Opus because, why not? But I could easily switch to a less capable model if needed.

JCTheDenthog 3 hours ago | parent [-]

>citing that it's illegal even though I own the games

In the. US at least it is actually illegal to download ISOs/roms of games, even if you own a physical copy. It's a stupid law and as a downloader (as opposed to the people hosting the files) your chances of getting into any kind of actual legal trouble are effectively 0, but it is still against the law.

mark_l_watson 6 hours ago | parent | prev | next [-]

I have a question that perhaps you or someone else here has an answer for: I enjoy using Opus via Google Antigravity (usually agy) for perhaps 90 minutes a week. For Google’s subsidized $20/month plan they seem to give out a reasonably generous amount of Claude tokens. How does this compare with Anthropic’s $20/month plan using Claude Code?

BTW, I also use DeepSeek v4 Flash very frequently: fast and so cheap it is almost free.

everforward 3 hours ago | parent | next [-]

It’s really hard to translate minutes to tokens, it depends on how you’re using it.

The best answer would be to pull session stats from your harness and compare that against the limits. I think Anthropic publishes the limits of each plan.

If you’re using a pretty stock harness and not doing crazy multi-agent stuff with it, you’re probably fine.

My girlfriend built a whole (but simple) React app with it and only hit the limits of the $20 plan once. In fairness, she was trying to get it to clean up a bunch of 800ish line React files at once with a vague “make it look nice” prompt that she ran a few times. I think it was just churning for like half an hour straight before she burned all her credits.

It’s probably enough if you’re not on a fully agentic development strategy, it’s plenty to have it write tests and do comments and stuff, just not enough to continually have it doing giant refactoring passes.

trollbridge 3 hours ago | parent | prev [-]

Anthropic's plans are based on user experience of usage, not raw token counts, so you get to run through so many conversation turns, etc. within a 5 hour usage window. (Cursor, OpenCode Go, and others are similar.)

Cursor's $20 a month plan provides a reasonable amount of Opus tokens as well.

okdood64 4 hours ago | parent | prev [-]

What's the speed on DeepSeek Flash? And what provider?

ForHackernews 4 hours ago | parent [-]

Fast enough? I signed up directly with https://platform.deepseek.com/ because it was the cheapest I could find. I use both Anthropic and Deepseek models via the VS Code copilot plugin https://github.com/Vizards/deepseek-v4-for-copilot

_the_inflator 6 hours ago | parent | prev | next [-]

I disagree. It is not the model alone. It needs a system which capitalizes on it. And this is very complex. Hardware, software, architecture - it takes a lot to get it right.

Try running the latest OS models on a normal Mac or PC. Claude Fable and Mythos are systems not just pure models.

And of course marketing. Don't believe the hype.

I think Claude is often times underwhelming. Security concerns are also a concern companies have a blond spot for. The really toughest pro security (Yes, pro! Totally different framing!) company I know is Google after all.

What I can companies advise to do is, really having more than just bug bounties but a professional hacker team that does nothing else but attacking them the whole day and night 24/7. This needs to be coordinated with the government otherwise you might sound an alarm and will be SWATed for doing good. And I would pay them huge sums since the risk and fallout warrant such a treatment, not the standard wage.

Hackers are the real deal, not AI. Proof: Hackers using AI.

zozbot234 6 hours ago | parent | next [-]

> Try running the latest OS models on a normal Mac or PC.

It can be done through the magic of SSD offload. The worst case involves seconds-per-token speeds, but that's OK if you only care about low volumes of slow unattended inference, which maximizes utilization for the hardware.

(The real worst case, where you're streaming the whole model from the cheapest storage you could feasibly think of, involves multiple minutes per token for a single inference, or even hours per token batch if you're doing many inferences in bulk. That's a lot less helpful, so there's a space for smaller models at the edge, even for unattended workloads.)

nerdsniper 6 hours ago | parent | prev | next [-]

> I disagree. It is not the model alone. It needs a system which capitalizes on it. And this is very complex.

AFAICT … despite saying you “disagree”, you appear to be agreeing with the parent comment that the model is less important and compute (all that complex infra) and data (also complex infra) are more important.

trollbridge 3 hours ago | parent | prev | next [-]

An LLM which provides an OpenAI or Anthropic API-compatible interface + a coding harness like OpenCode or oh-my-pi is a pretty easy "ecosystem" to replicate. Exactly what makes you say Fable or Mythos are "systems, not just pure models"?

everforward 3 hours ago | parent [-]

Fable can delegate tasks to Opus or Sonnet, so it has some agentic properties and I believe it does them in parallel.

The parallelism is where this starts to fall apart on a local PC. Like I can run some Qwen quants, but I can’t run a decent Qwen model while also running another model smart enough to actually implement it. I’d have to do them in series, and given how long Fable seems to take even with parallelism, I’d probably be waiting days for an answer.

trollbridge 3 hours ago | parent [-]

oh-my-pi can delegate tasks to other models too. I usually use DS4 Flash for low priority subagent tasks.

If Fable is "delegating" tasks, then there's actually an agent front end of whatever you think the API is.

We have a local instance of Qwen-3.6 which is more than adequate for running agents. You can mix and match local and cloud-hosted models. (My biggest use case for local models right now is vision models because they're quite small and I can avoid some data-locality issues my customers wouldn't be comfortable with if I sen them to a Chinese model.)

ramblurr 6 hours ago | parent | prev | next [-]

> > The bottleneck is compute and data, not the model.

> I disagree. It is not the model alone. It needs a system which capitalizes on it. And this is very complex. Hardware, software, architecture - it takes a lot to get it right.

What do you disagree with exactly?

christkv 6 hours ago | parent | prev [-]

For now I suspect however that the gigantic models are not needed and you will be able to do pretty much what you need in a specific domain with 120b or lower. There is so much trash in the frontier models. I don't need all the world's slam poetry for my coding tasks for example.

ACCount37 5 hours ago | parent [-]

Wrong, mostly.

Model capability is a function of model size. Raising the bar raises model performance in every domain.

An "idiot savant" model that's overtrained for a specific domain would beat a generalist model of the same size. But scale the generalist up enough, and it'll trounce the specialist. Removing poetry data from a model training mix doesn't give you much - it might even cost you some performance - and "idiot savant" approach of overtraining for a domain has a hard ceiling.

So far, it seems like there's some equivalent of "g factor" in LLMs - a broad "intelligence" value that performance across many diverse domains correlates with. And, as a rule, larger models have more of it.

everforward 3 hours ago | parent | next [-]

While I disagree with OP about removing stuff from the model, there’s a valid question about tradeoffs between intelligence and price.

Deepseek Flash is almost certainly wrong more often than Opus or Fable. It also costs like 5% as much.

The question becomes if I run Deepseek in a loop to fix the mistakes it made that Opus/Fable didn’t, can it fix its own bugs in few enough tokens that it’s still cheaper?

So far, the answer seems to be “yes, by a significant margin”. A lot of tasks are simple enough that both Deepseek and Opus or Sonnet can one-shot it, which is a huge cost win for Deepseek. Even on the long tail, it’s usually like 4x the tokens on Deepseek which is still way cheaper than Opus.

There are things that Opus can do that Deepseek just won’t ever really nail, but it happens so infrequently that I just don’t worry. Like most people, most of what I do is the same sort of “3 tier app with a React frontend” that doesn’t take a rocket scientist to work out.

overfeed 4 hours ago | parent | prev [-]

> Wrong, mostly.

> Model capability is a function of model size

Model effectiveness has improved across model sizes. You really should try the latest flash variants more. They have become my default for most tasks except for gnarly high-level planning.

trollbridge 3 hours ago | parent | next [-]

Right - the idea that "bigger model = better" might have been true a year ago, but the flash models are extremely effective right now. You simply use them for the tasks they are ideally suited for.

ACCount37 4 hours ago | parent | prev [-]

"Capability per parameter" is rising, but parameter count remains an advantage. And small models remain bad, because "good" is a rapidly moving target.

A 2026 4B beats 2024 4B, but both are far behind the contemporary frontier. Which makes them bad. There is no such thing as "too much capability" - a "good" model is whatever the current frontier is.

In 2024, a "good" model is one that can be trusted to write a 800 line script. In 2026, it's a model that can be trusted to do gnarly high-level planning and execution both. In 2028, it's going to be something like a model you can point at an extremely involved task, abandon, and have it report back with a "done" in 3 weeks.

overfeed 2 hours ago | parent [-]

> A 2026 4B beats 2024 4B, but both are far behind the contemporary frontier.

The thing about engineering is you don't just use the biggest bolt on the market on every bridge.

> In 2024, a "good" model is one that can be trusted to write a 800 line script. In 2026, it's a model that can be trusted to do gnarly high-level planning and execution both

This sounds a lot like having a single diamond-head hammer as the only tool in your toolbox. As suggested by the name, flash models are fast - sometimes I want to write the equivalent of fifty 800-line scripts. There is such a thing as good enough.

ACCount37 2 hours ago | parent [-]

Good enough? That's a lie people tell each other because they lack imagination.

"It's good enough" was said about GPT-4, o1, o3, Opus 4 and more. Guess what happened? Newer models released, people updated their expectations of what LLMs can do, usage got more aggressive, and somehow, GPT-4 went from "good enough" to "obsolete trash".

If you have no imagination, then at least substitute your pattern recognition for it.

The world is hungry for capabilities. There are piles upon piles of tasks that aren't done by LLMs simply because LLMs aren't good enough to do them.

The thing a frontier model gives you is "you don't have to babysit a model to get it to do X", and that X gets more and more impressive release to release.

overfeed 2 hours ago | parent [-]

I wish you had addressed at least one of arguments in good faith before jumping to insults and countering a strawman argument I didn't make - I never claimed their will be no use for more capable models.

You do your AI-maximalism, and I'll stick to making trade-offs based on the needs of each piece of work.

ACCount37 44 minutes ago | parent [-]

I.e. spending your time and effort on making choices that don't matter.

I'll do more "per-task model selection" when AIs themselves get good at it.

zozbot234 6 hours ago | parent | prev | next [-]

"Distillation" from APIs is not a thing, it cannot replicate a model's deep reasoning and behavior.

bob1029 6 hours ago | parent | next [-]

I struggle with the practicality of the whole thing.

The amount of tokens required to properly distill a frontier model is so large that by the time you could consume the # of tokens you would either be banned for extremely obvious abuse or a new model would be released, rendering your efforts less and less valuable over time. Intelligence is not a linear thing. Being behind just a little bit can have exponential consequences.

Aperocky 5 hours ago | parent [-]

> Being behind just a little bit can have exponential consequences.

That seems to be the argument of Dario, Sam et. al., but I'm not ready to believe it. Time will tell, but this can be a marathon and Anthropic and OpenAI is in getting ready to sprint the last lap of the first mile.

archon 6 hours ago | parent | prev | next [-]

I'm uneducated on how distillation works at more than a basic level so forgive me if this is a stupid question.

Isn't "distillation" of another provider's model exactly how these models got training date in the first place: Massive amounts of the written word + Prompt -> Answer. Why wouldn't distillation produce similar "reasoning" in the new model? It's just inputs and outputs.

maxbond 6 hours ago | parent | next [-]

What you're describing is (pre-)training. Distillation requires richer labels, the probability distribution over tokens (it would be logits rather than probabilities but that's not important). From a chat transcript you can only understand the argmax/most likely token of that distribution (and only if the API allows you to set the temperature to 0). It's not impossible for an API to give you that but they won't if they don't want you distilling their models.

The intuition is that distillation exploits not only the "right" answer but the relationship between answers (what's the second most right answer? the third? etc).

zozbot234 6 hours ago | parent | prev [-]

Among other things, because you simply can't get those "massive amounts" of text from a SOTA model at reasonable cost. And complex reasoning cannot possibly be trained in a pure one-shot fashion, real post-training takes massive resources. The whole story doesn't add up.

saberience 6 hours ago | parent | prev [-]

This is totally inaccurate, the APIs provide the reasoning logs. You ABSOLUTELY can distill from APIs, in fact, that's the primary way distillation is done currently.

zozbot234 6 hours ago | parent [-]

Not for proprietary models, all you get is a terse summary.

6 hours ago | parent [-]
[deleted]
olmo23 7 hours ago | parent | prev | next [-]

> no nationality controls in place

Not for now, but how long before we have KYC regulations concerning LLMs?

thefounder 7 hours ago | parent | next [-]

That’s really what Dario wants. Let’s hope he doesn’t get it

baq 7 hours ago | parent | next [-]

what Dario wants is to retain any influence whatsover on how the research progresses before the inevitable nationalization of the frontier. he gets to keep the N-2 tech and maybe influence the N-1 tech, but the only influence on the frontier he has is today; whatever he imprints in the pipeline the government takes over.

IOW I don't think he thinks in the same categories as most folks here.

overfeed 4 hours ago | parent | next [-]

> ...the research progresses before the inevitable nationalization of the frontier.

Hacker News has been telling me America beats China at "innovation" because of the "freedoms" - especially frew enterprise. I wonder how a nationalized frontier lab would perform.... Andhow the non-citizen researchers would feel about working for the US government that doesn't trust them to use frontier models.

stogot 6 hours ago | parent | prev [-]

N-1? N-2?

Avicebron 6 hours ago | parent [-]

Best-possible-model (N) - Two Generations (2), same with N-1, N is the SOTA in this example. I'm not sure that actually clarifies what the comment is trying to say other than they think the models will be nationalized (can't even imagine what that would look like).

baq 6 hours ago | parent [-]

basically imagine the Manhattan project, but instead of blowing up the desert they're building the biggest datacenter you've ever seen.

Avicebron 5 hours ago | parent [-]

Isn't this the beginning of the plot of "I Have No Mouth, And I Must Scream"? The exceptionally disturbing dystopian horror?

baq 5 hours ago | parent [-]

the possible futures after the thing is built are uncountable, but hoping the thing won't get built at this point is naive.

in general I agree people should be reading a lot more sci-fi nowadays than they used to.

stogot 5 hours ago | parent [-]

I read the popular ones, but itch for more. Which sci fi most applies today?

vbezhenar 6 hours ago | parent | prev | next [-]

But he already got it, no? Claude Fable can only be made available to US citizens, which implies that every user who wants to use Claude Fable must provide proof of citizenship in some way, basically KYC.

misnome 3 hours ago | parent [-]

For everyone, not just them

dofm 7 hours ago | parent | prev [-]

Regulatory capture is the OpenAI and Anthropic end goal, for certain.

But I also think they exist in a sort of un-designed corporate narcissism, which is a common trait in bubble economies — I am not judging them particularly severely.

Netscape under Clark and Andreessen and Sun under McNealy both fell into corporate narcissism: the belief that only they really mattered, that they were chosen, and that the world needed to rearrange itself to just let them shine. They arguably let themselves get played by Oracle (a corporate psychopath) and others as a result.

OpenAI's position is profoundly corporate-narcissistic: all we need is all the money in the economy and not to have to do anything upsetting like think about turning a profit for the next four years. Like rich kids. It would be nice if you believed we were so important that we should get an enormous stipend for just being us.

Anthropic's position is: we think we're so unique and ominous that government needs to make us both essential and terrifying. We have to exist otherwise worse people will.

Both narcissistic positions.

baq 7 hours ago | parent | next [-]

> Regulatory capture is the OpenAI and Anthropic end goal, for certain.

it has to be, because the other way around - the government taking over parts or the whole thing - is inevitable if the trend holds.

blitzar 7 hours ago | parent | next [-]

the inevitable trend is that numbers will be free and nobody will control the whole thing

ai-celebrities are just clinging to relevance like all the other celebrities out there

intended 6 hours ago | parent [-]

HN is the builder side of the conversation, and in my experience, few safety people congregate here.

The safety side of tech is a PTSD inducing shit show. Governments are more than happy to champion age verification laws, because parents, around the world, are clamoring for anything to pump the breaks on the social media experiment.

Society outside of HN is quite tired of Tech, and I despair of figuring out a way to make this clear to the commentariat.

ang_cire 6 hours ago | parent | next [-]

Social media is old hat now.

As someone on the "safety side of tech", social media is being exploited to increase surveillance and government control precisely because its actual social influence is heavily on the wane, and capital is happy to sacrifice what's left to increase the profits of the expanding public/private tech surveillance industry (with "protect the children" controls on social media like age verification being the usual backdoor route it always is).

Society may be growing tired of Tech, but governments aren't, and in fact they're heavily expanding their back channel reliance on not-traditionally-military Tech as an extension of their Defense spending.

intended 5 hours ago | parent [-]

Cyber security has the maturity that trust and safety hopes to achieve at some point.

Social media was being exploited from inception. Palantir had sales documents for sock puppet management software back in the PHP era.

I don’t disagree that Government is interested in tech, but I will push back on the dismissal of child safety that is inherent in your comment, intended or not.

For all that some people in the firm may have tried to do the right thing, Social media firms have created bad outcomes for children, and executives were briefed on the harms they were going to cause.

This is the dismissal that concerns me, because it ends up miscalculating the level of anger and unhappiness amongst the voting populace, and therefore the political will to pass regulation to reign tech in.

ang_cire 33 minutes ago | parent [-]

The political will is already captured and redirected.

There are numerous bills to limit AI access for consumers, to combat deepfakes hurting children. There are no bills introduced or passed to prevent AI being used to target dronestrikes that kill children abroad, or surveil children domestically.

What the public wants doesn't actually matter right now, only what the government will allow to let pass, which in this case is additional internet surveillance.

Under a future, better government this may change, but (sadly) nothing is going to sink tech's dominance right now.

The anger and unhappiness against tech is good, and hopefully someday they'll burn down all the data centers and I'll never have to hear the words Cloud Computing again, but (to paraphrase a famous Eve Online interaction) it's not going to be today, and it's not going to be us.

dofm 6 hours ago | parent | prev | next [-]

> Society outside of HN is quite tired of Tech, and I despair of figuring out a way to make this clear to the commentariat.

I don't think anyone in tech is really truly engaging with how quickly the shine has come off the tech industry. Except maybe Apple, who even so still have some work to do.

malfist 6 hours ago | parent [-]

Technology and science is the intersection that is supposed to make our lives better, easier, more prosperous. The last decade or two what marvelous technology has came from silicon valley that hasn't served primarily the billionaire class and made life worse for the common people.

The yoke of silicon valley is feeling heavy. People might just throw it off.

thewebguyd 2 hours ago | parent | prev [-]

> Society outside of HN is quite tired of Tech, and I despair of figuring out a way to make this clear to the commentariat.

s/Tech/Tech Companies

Tech did it to themselves. People like and want technology. What they don't like and don't want more of is enshittified, user hostile technology. The answer is out there, but our collective school systems failed to teach computing irt free software/open source and instead schools themselves all bought in on enshittified, proprietary tech, or even just dumped trying to teach computing at all outside of "how to login to google classroom and google docs"

I grew up lucky, in that my dad was a dev, my first PC as a kid ran red hat, my high school had an intro to programming class (in BAISC lol). It shaped how I approached computing growing up, and my values. It makes me look at the things we have now and think "No, you're just repackaging community free software and selling it back to me, I'll pass on that."

That experience isn't available to anyone born after that specific era, instead their tech experience is shaped by walled gardens, vendor lock-in, and straight up hostile and manipulative software, so its no wonder they are tired of it. They don't even know a different world (of software) exists.

dofm 7 hours ago | parent | prev [-]

Porque no los dos?

baq 7 hours ago | parent [-]

this is exactly the play is my point

Aperocky 5 hours ago | parent | prev [-]

Spot on. There's a certain level of drinking the kool-aid or getting high on their own supply. Anthropic is a lot worse than OpenAI but OpenAI had to go through rounds of shedding.

trollbridge 3 hours ago | parent | next [-]

A\ and OpenAI each have their own unique kind of nonsense. I think OpenAI has just been less successful with persuading the rest of the world that they should have all the money in the world.

Anthropic has been surprisingly successful at convincing them that they should control frontier models because they're so dangerous that... only Anthropic can be trusted with them.

(If they're really so dangerous, the right way to deal with them is through a democratic process and taking them out of the hands of a for-profit private entity.)

dofm 4 hours ago | parent | prev [-]

To be maximally fair to them, I think it is difficult to be one of the key businesses in a market bubble and not fall victim to this kind of thinking, especially when the continued inflation of the bubble depends on you — lots of people lose their shirts if you don't push hard to be "special".

But as you say, there is a measure of getting high on one's own supply now.

And there's the curious solipsistic energy of Sam Altman whimsically musing in public that it turns out his product is too expensive for people and they complain when you make the price realistic (when it possibly needs to be more expensive for OpenAI to survive).

They seem to believe that the ordinary rules either will not or somehow must not apply to them; it's increasingly bizarre to watch.

Maybe the people around pets.com were this bizarre; we didn't have so much livestreamed interview content to show us.

7 hours ago | parent | prev | next [-]
[deleted]
throw1234567891 7 hours ago | parent | prev [-]

Yeah yeah, but after the IPO!

barrkel 6 hours ago | parent | prev | next [-]

Do you think token completion endpoints are the final form for AI APIs?

slowmovintarget 5 hours ago | parent | prev | next [-]

That thesis is not about what Anthropic will achieve, but about what power they think they ought to have.

That's a different problem that what you're arguing against.

almostdeadguy 3 hours ago | parent | prev | next [-]

To this point, I've never understood the supposed "alignment" between the EA/AI Safety crowd and Anthropic's mission that the author comments on. Be the stewards of the Machine God, but responsibly? I think the Manhattan project, which AI development is commonly analogized to, had a lot more intrinsic properties to gate against uncontrolled proliferation (which still happened to some extent). Also this is a company that is expected to go public this year, at which point there will be a slew of new voices pushing the company to increase its value, mission be damned.

People like Yud at least have a clear consistency in their advocacy that we shouldn't be developing this at all. Anyone who thinks they can reconcile Anthropic's work with the AI safety mission is in total fantasyland, if it's not just a public persona they've adopted strategically.

swalsh 7 hours ago | parent | prev [-]

The distilled versions miss the spark of the model. Its like they land in the uncanny valley of models.

realusername 6 hours ago | parent [-]

They get to 80% of the top models for 10x cheaper, unless you don't care about the money at all, it's hard to ignore.