Remix.run Logo
bkjlblh 2 hours ago

> In light of the ability of recent models to accelerate their own development, we’ve implemented new interventions that limit Claude’s effectiveness for requests targeting frontier LLM development (for example, on building pretraining pipelines, distributed training infrastructure, or ML accelerator design). Using Claude to develop competing models already violates our Terms of Service, but enforcing this restriction through our safeguards avoids accelerating the actors most willing to violate these terms. Unlike our interventions for cybersecurity, biology and chemistry, and distillation attempts, these safeguards will not be visible to the user. Fable 5 will not fall back to a different model. Instead, the safeguards will limit effectiveness through methods such as prompt modification, steering vectors, or parameter-efficient fine-tuning (PEFT). These interventions will not affect the vast majority of coding work. We estimate they will impact ~0.03% of traffic, concentrated in fewer than 0.1% of organizations

cedws an hour ago | parent | next [-]

This makes me want to see China and open models succeed more than anything :)

382hi an hour ago | parent [-]

Don't worry, we will succeed :)

UncleOxidant 38 minutes ago | parent [-]

Can we get a Qwen3.7-122B, please? Thank you.

mips_avatar 2 hours ago | parent | prev | next [-]

It's bad that Anthropic can determine what this means. If you're building a modern app you're likely training your own embedding models and now anthropic can just silently sabotage your training pipelines?

abixb 38 minutes ago | parent [-]

>We estimate they will impact ~0.03% of traffic, concentrated in fewer than 0.1% of organizations

At the scale of API requests that Anthropic sees, I think the affected organization count might be substantial, and they might not be getting the full model capability that they're paying top $$$ for.

Also, wonder how they arrived at that estimation.

wongarsu 29 minutes ago | parent [-]

One in 1000 organizations and one in 3000 requests is indeed a lot

matheusmoreira an hour ago | parent | prev | next [-]

Looks like Anthropic's definition of safety includes their own safety from competition.

dragonwriter 23 minutes ago | parent | next [-]

AI vendor’s idea of safety has always been safety for the interests of the AI vendor in question. This is not a new development, though this may help more people realize it.

SAI_Peregrinus an hour ago | parent | prev | next [-]

It's always been about the safety of their valuation.

wongarsu 22 minutes ago | parent [-]

Only since Claude 3. So a bit over two years now

axus an hour ago | parent | prev [-]

AI-generated competition for thee, not for me

2001zhaozhao an hour ago | parent | prev | next [-]

How do they detect whether an experiment being done on a smaller model is used to improve a competing frontier model, or just an innocuous hobbyist LLM experiment?

vitally3643 42 minutes ago | parent | next [-]

Given how well the cybersecurity safeguards work, they probably don't.

iririririr 39 minutes ago | parent | prev [-]

infering the surroundings, like everything else. they will probably look at which company is your email, and if you wrote "better than claude" on the readme.md

this is LLM, it's not like a science or something.

Jabrov 2 hours ago | parent | prev | next [-]

A million AI researcher voices at big tech companies suddenly cried out in terror and were suddenly silenced

seemaze 30 minutes ago | parent | prev | next [-]

Ah, so this is why raw Mythos was too "dangerous" to realease..

rfgplk an hour ago | parent | prev | next [-]

Meaningless and easily bypassable. Will actually try coding up a tensor library with it, see if it sabotages anything.

mips_avatar an hour ago | parent | next [-]

They said in their terms and conditions they will silently sabotage you if you do this.

qiine 6 minutes ago | parent | prev [-]

easily ?

hashmap 30 minutes ago | parent | prev | next [-]

3 months before asking for what to eat before a linear algebra exam trips the machine learning topic ban is my guess. I got flagged immediately asking why my JEPA thing breaks weird.

thepasch 33 minutes ago | parent | prev | next [-]

Yeesh. Anthropic's paranoia about China is starting to get pathological.

rspeele an hour ago | parent | prev | next [-]

It's afraid!

an hour ago | parent | prev | next [-]
[deleted]
theLiminator an hour ago | parent | prev [-]

This is pretty bullshit, now you have no idea if your output is getting silently nerfed.