| ▲ | mentalgear 7 hours ago | |||||||||||||
While I'm certainly sceptical of pure LLM (re)-written software, I would have to assume in the case of the cyberattack vector that Anthropic used their new Mythos model to adequately test against. Maybe someone has more info of them mentioning that. | ||||||||||||||
| ▲ | bastawhiz 35 minutes ago | parent | next [-] | |||||||||||||
> to adequately test against How does one determine what "adequate" looks like for a million lines of code? You can't fit a million lines of code in a 1M token context window unless every line of code is one token. So you're just sort of praying you spend enough time/money burning tokens to shake out all the stuff that's bad or wrong. | ||||||||||||||
| ▲ | InsideOutSanta 5 hours ago | parent | prev | next [-] | |||||||||||||
I wouldn't be surprised if the kinds of security issues LLMs tend to create are the exact types of security issues LLMs are bad ar detecting. | ||||||||||||||
| ▲ | skeeter2020 5 hours ago | parent | prev | next [-] | |||||||||||||
so they are defending the LLM-generated code using another one of their LLMs, against attacks from yet other LLMs? So regardless of the outcome and impact on us, they win? | ||||||||||||||
| ▲ | impulser_ 6 hours ago | parent | prev [-] | |||||||||||||
Jarred said this had nothing to do with Mythos or Anthropic. | ||||||||||||||
| ||||||||||||||