Remix.run Logo
Analemma_ 9 hours ago

“Opus 4.6 found 22 security bugs, Mythos found 271 on an initial evaluation” sure seems to refute the grumbling I’ve seen from a couple OAI people on Twitter that Mythos isn’t actually anything special and everything it finds could be found by earlier models too.

jruohonen an hour ago | parent [-]

They also put this in the end in boldfaced:

"Encouragingly, we also haven’t seen any bugs that couldn’t have been found by an elite human researcher."

But, in overall, I think it was a well-written positive take (instead of the fear-mongering party line).