Remix.run Logo
consumer451 3 hours ago

Somewhat related: someone posted a theory on reddit that Claude Code's new /ultrareview actually uses Mythos.

Does that seem plausible to anyone else? It runs on their cloud. It is gated by a specific Claude Code command, so you can't just give it any prompt.

tekacs 3 hours ago | parent | next [-]

Something in favor of this is the fact that it runs in their cloud and literally tells you that it costs I think $10 to $25 per run

1ucky 3 hours ago | parent | prev | next [-]

Why would they use their most expensive model when sonnet or opus can do the job as well?

0x696C6961 3 hours ago | parent | prev [-]

It would be pretty simple to see what API they're calling.

consumer451 3 hours ago | parent [-]

That's what I meant to get at by "it runs on their cloud."

They can name that user-facing ultrareview API endpoint whatever they want, and we have no way to see what model endpoint it calls internally once running on their cloud, right?

zarzavat 2 hours ago | parent [-]

Introduce intentional and increasingly subtle vulns and test against Sonnet, Opus, etc? Should give statistical evidence of its power.