| ▲ | JellyYelly 3 days ago | ||||||||||||||||||||||||||||||||||||||||||||||
They say its mythos like, without actually comparing it to Mythos (fair enough, it's not public) but the bar for a model to be mythos-like has to be that you can produce as many novel and high severity security vulns outlined in the Mythos redteam blog. I haven't seen any other lab produce a report like that yet. The proof is in the pudding. | |||||||||||||||||||||||||||||||||||||||||||||||
| ▲ | halJordan a day ago | parent | next [-] | ||||||||||||||||||||||||||||||||||||||||||||||
It benches very similarly on the cyber benches Anthropic put out. That meets the bar. | |||||||||||||||||||||||||||||||||||||||||||||||
| ▲ | cassianoleal 3 days ago | parent | prev [-] | ||||||||||||||||||||||||||||||||||||||||||||||
> The proof is in the pudding. Funny you say that, when the Mythos team have produced no proof either. | |||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||