| ▲ | deminature 2 hours ago | ||||||||||||||||
One of the major attack vectors is distillation, where millions of questions are auto-generated and coordinated to produce training data for new LLMs. Anthropic alleges Minimax, Deepseek and Kimi were trained this way. Deepseek 4 compares favorably to Opus, so they're probably trying to prevent Deepseek 5 from being a bootleg Mythos. https://www.anthropic.com/news/detecting-and-preventing-dist... | |||||||||||||||||
| ▲ | pseudosavant an hour ago | parent | next [-] | ||||||||||||||||
It takes a lot of audacity to train on all the data you can without any license, attribution, etc and then act like you can own the outputs of the model so that someone else doesn't make a model from your data without a license. I've lost a lot of respect for Anthropic in the last 24 hours. | |||||||||||||||||
| |||||||||||||||||
| ▲ | anon373839 an hour ago | parent | prev [-] | ||||||||||||||||
Distillation is not an "attack", despite Anthropic themselves coining the self-serving phrase "distillation attack". And as others have noted, it is precisely identical to the sort of "attack" on published works which Anthropic themselves used to train their models. | |||||||||||||||||
| |||||||||||||||||