| ▲ | bkjlblh 2 hours ago | ||||||||||||||||||||||||||||
> In light of the ability of recent models to accelerate their own development, we’ve implemented new interventions that limit Claude’s effectiveness for requests targeting frontier LLM development (for example, on building pretraining pipelines, distributed training infrastructure, or ML accelerator design). Using Claude to develop competing models already violates our Terms of Service, but enforcing this restriction through our safeguards avoids accelerating the actors most willing to violate these terms. Unlike our interventions for cybersecurity, biology and chemistry, and distillation attempts, these safeguards will not be visible to the user. Fable 5 will not fall back to a different model. Instead, the safeguards will limit effectiveness through methods such as prompt modification, steering vectors, or parameter-efficient fine-tuning (PEFT). These interventions will not affect the vast majority of coding work. We estimate they will impact ~0.03% of traffic, concentrated in fewer than 0.1% of organizations | |||||||||||||||||||||||||||||
| ▲ | cedws an hour ago | parent | next [-] | ||||||||||||||||||||||||||||
This makes me want to see China and open models succeed more than anything :) | |||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||
| ▲ | mips_avatar 2 hours ago | parent | prev | next [-] | ||||||||||||||||||||||||||||
It's bad that Anthropic can determine what this means. If you're building a modern app you're likely training your own embedding models and now anthropic can just silently sabotage your training pipelines? | |||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||
| ▲ | matheusmoreira an hour ago | parent | prev | next [-] | ||||||||||||||||||||||||||||
Looks like Anthropic's definition of safety includes their own safety from competition. | |||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||
| ▲ | 2001zhaozhao an hour ago | parent | prev | next [-] | ||||||||||||||||||||||||||||
How do they detect whether an experiment being done on a smaller model is used to improve a competing frontier model, or just an innocuous hobbyist LLM experiment? | |||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||
| ▲ | Jabrov 2 hours ago | parent | prev | next [-] | ||||||||||||||||||||||||||||
A million AI researcher voices at big tech companies suddenly cried out in terror and were suddenly silenced | |||||||||||||||||||||||||||||
| ▲ | seemaze 30 minutes ago | parent | prev | next [-] | ||||||||||||||||||||||||||||
Ah, so this is why raw Mythos was too "dangerous" to realease.. | |||||||||||||||||||||||||||||
| ▲ | rfgplk an hour ago | parent | prev | next [-] | ||||||||||||||||||||||||||||
Meaningless and easily bypassable. Will actually try coding up a tensor library with it, see if it sabotages anything. | |||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||
| ▲ | hashmap 30 minutes ago | parent | prev | next [-] | ||||||||||||||||||||||||||||
3 months before asking for what to eat before a linear algebra exam trips the machine learning topic ban is my guess. I got flagged immediately asking why my JEPA thing breaks weird. | |||||||||||||||||||||||||||||
| ▲ | thepasch 33 minutes ago | parent | prev | next [-] | ||||||||||||||||||||||||||||
Yeesh. Anthropic's paranoia about China is starting to get pathological. | |||||||||||||||||||||||||||||
| ▲ | rspeele an hour ago | parent | prev | next [-] | ||||||||||||||||||||||||||||
It's afraid! | |||||||||||||||||||||||||||||
| ▲ | an hour ago | parent | prev | next [-] | ||||||||||||||||||||||||||||
| [deleted] | |||||||||||||||||||||||||||||
| ▲ | theLiminator an hour ago | parent | prev [-] | ||||||||||||||||||||||||||||
This is pretty bullshit, now you have no idea if your output is getting silently nerfed. | |||||||||||||||||||||||||||||