new | show | ask | jobs Github

enraged_camel 5 hours ago

>> if a model like Mythos, which at best is an incremental improvement over Opus

What an unbelievable claim. Especially since the vast majority of publicly available benchmarks disagree.

▲

BobbyJo 5 hours ago | parent [-]

The model card for mythos shows it being an incremental improvement in all respects besides security.

▲

hodgehog11 4 hours ago | parent | next [-]

This is utterly daft to say if you actually used the thing for hard problems, something that benchmarks have been known to be unable to capture. It is night and day compared to Opus and every other model out there. It was nice while it lasted.

▲

sixothree 2 hours ago | parent [-]

It's strange how uninformed people are when they are so willing to to make assertions. I used it too and it really felt like a generational shift and not an incremental one.

These threads about Anthropic always seem so astroturfed with some of the loudest and most uninformed people around.

	▲	tobyhinloopen 2 hours ago \| parent [-]
		I agree with this, It feels like a small upgrade like Opus 4.9 or something. It’s still pretty good though

▲

bonsai_spool 5 hours ago | parent | prev [-]

Ah yes, the model card that shows an over 10% improvement in agentic coding among other things!

https://www.anthropic.com/news/claude-fable-5-mythos-5