Remix.run Logo
louiereederson 6 hours ago

I don't think you can say this with confidence, outside-in. It's not just about safety. The additional unknown is cost - I don't just mean API cost, but fully loaded cost for a given task. Is the model cost effective for tasks such that it has product market fit?

We don't yet know if Mythos was a level shift in the capability/cost frontier, or a continued extension of the same logarithmic capability/cost curve.

solenoid0937 6 hours ago | parent [-]

Some people have access to the model for red team purposes as part of Glasswing and they came away quite spooked according to what I heard

louiereederson 6 hours ago | parent [-]

I don't doubt it, I just mean the decision to release/not release generally may also be informed by the commercial/economic viability of the model for general usage patterns versus extremely high value patterns like vulnerability assessment