Remix.run Logo
ChadNauseam 3 days ago

> Anthropic claims Mythos is in a class of its own, the evidence corroborates this and the government believes it.

They didn't release Mythos, they released Fable, which was Mythos + a classifier that detected potentally-dangerous prompts and blocked them. Everyone who used it noticed how aggressive the classifier was. It would trigger constantly over totally innocent stuff.

catigula 3 days ago | parent [-]

A classifier that was exposed as non-efficacious for a product touted as having extremely dangerous capabilities.

I can generate hacks trivially by asking any model to fix open source code.

Let’s not pretend you get to have your cake and eat it too.