Remix.run Logo
sciencejerk 2 hours ago

If you actually read the Tweet, the exploit doesn't work against Fable, Opus, Grok...at least, in the examples.

Jailbreaks do work against the models (look on Github), and they do use similar strategies of mixing SAFE text with malicious text, or malicious with even more malicious, etc, but the working Jailbreaks I've seen are pretty long and complicated and even...creepy.

csomar an hour ago | parent [-]

Did you actually read what the tweet/blog post are about?