| ▲ | 2 days ago | |||||||
| [deleted] | ||||||||
| ▲ | embedding-shape 2 days ago | parent [-] | |||||||
> Curiously Opus 4.7 claims to have a 87.6% pass rate and Mythos claims to have a 93.9% pass rate... leading to the conclusion that it's actually possible to "solve" the problems that OpenAI claims are incorrect. Huh, that is very curious and interesting indeed. If that's indeed true, that Anthropic claims that pass rate while OpenAI claims the test cases are flawed and broken, then clearly one of them aren't telling their whole side... | ||||||||
| ||||||||