Remix.run Logo
JellyYelly 3 days ago

They say its mythos like, without actually comparing it to Mythos (fair enough, it's not public) but the bar for a model to be mythos-like has to be that you can produce as many novel and high severity security vulns outlined in the Mythos redteam blog. I haven't seen any other lab produce a report like that yet. The proof is in the pudding.

halJordan a day ago | parent | next [-]

It benches very similarly on the cyber benches Anthropic put out. That meets the bar.

cassianoleal 3 days ago | parent | prev [-]

> The proof is in the pudding.

Funny you say that, when the Mythos team have produced no proof either.

subscribed 3 days ago | parent | next [-]

Not sure if the reports like this count? https://www.theregister.com/2026/04/22/mozilla_firefox_mytho...

I don't have strong opinion on that.

maplethorpe 3 days ago | parent | prev [-]

I believe they've stated that it would be too dangerous to release.

satvikpendem 3 days ago | parent | next [-]

Just like OpenAI said GPT 2 was too dangerous to release?

There was just an article on this phenomenon today: https://news.ycombinator.com/item?id=47890235

maplethorpe 3 days ago | parent [-]

They released a system card talking about how powerful it was. I don't think OpenAI did that with GPT 2.

satvikpendem 2 days ago | parent [-]

I mean, that's just part of the marketing too. OpenAI would've absolutely added a system card, they just weren't invented back in the GPT 2 era.

Razele 2 days ago | parent | prev [-]

too uneconomical to run*