Remix.run Logo
2ndorderthought 3 hours ago

"my model is the most dangerous"

"No mine is the most dangerous"

"Nuh uh mine is"

"Mine could kill everyone!"

"Mine could do it faster!"

"Prove it!!!"

This is where we are

davidgrenier 3 hours ago | parent | next [-]

Yeah I guess two companies who would otherwise be considered going for bankruptcy have models too expensive to run. As they don't see themselves making money any time soon, they have to turn every future model into a weird fascination.

DivingForGold an hour ago | parent | next [-]

China’s DeepSeek prices new V4 AI model at 97% below OpenAI’s GPT-5.5

Did somebody say that Elon is stealthly funding: Seven lawsuits filed against OpenAI by families of Canada mass-shooting victims

As always, when the going get's tough, the tough ultimately resort to lawsuits.

cyanydeez 2 hours ago | parent | prev | next [-]

think about it in the form of who can pay. theyre at b2b. and swiftly moving to government.

2ndorderthought 2 hours ago | parent [-]

All that user data is a huge asset for government contracts.

redsocksfan45 3 hours ago | parent | prev [-]

[dead]

cedws 32 minutes ago | parent | prev | next [-]

Can't wait for the Chinese models to completely wipe the floor with them in 6 months.

peddling-brink 17 minutes ago | parent | next [-]

Ominous phrasing.

dk970 24 minutes ago | parent | prev [-]

[dead]

noosphr an hour ago | parent | prev | next [-]

Remember that they have been saying that since gpt2.

I didn't think crying could be such a successful business model.

lesuorac an hour ago | parent [-]

It's just "thinking past the sale" which they've been doing forever.

i.e. "I'm so worried that our capped for-profit structure will limit your returns when we make over 1 Trillion in profit".

boringg an hour ago | parent | prev | next [-]

Marketing stunts. The equivalent of holding a line outside a popular bar.

basisword an hour ago | parent [-]

Given the USG has asked Anthropic not to release Mythos I'd wager it's more than a marketing stunt.

boringg an hour ago | parent [-]

It can be both and I don't know how much I would trust the USG as the canary in the coal mine given their technical readiness typically seems low across most institutions in that they are probably more exposed because they haven't shored up their systems.

brikym 3 hours ago | parent | prev | next [-]

It's like that phone call in The Big Short where Goldman suddenly change their mind once they hold a position.

concinds 3 hours ago | parent | prev | next [-]

These models demonstrably have good vulnerability research capabilities.

I'm sure their marketing department is ecstatic but you guys are far more hype-based than what you're calling out.

authnopuz 2 hours ago | parent | next [-]

Good but not necessarily better that was is already pay-as-you-go available today. ref. https://www.flyingpenguin.com/the-boy-that-cried-mythos-veri...

This AISLE benchmark is interesting in this matter: https://aisle.com/blog/ai-cybersecurity-after-mythos-the-jag...

And the recently discovered Copy Fail by Xint code is another proof that the gating is overblown: https://xint.io/blog/copy-fail-linux-distributions

ZyanWu 3 hours ago | parent | prev [-]

> demonstrably

I'm not entirely up to date on each week's LLM hype train/scandal but last I heard there was no public access to it or public-trusted 3rd parties that can review model's capabilities

2ndorderthought 2 hours ago | parent | next [-]

You are up to date. Mythos had unauthorized access because of poor security but that's it as far as I know. Not exactly a good sign for something being advertised as a weapon...

SpicyLemonZest 2 hours ago | parent | prev [-]

It’s easy to end up with no public-trusted third parties if we arbitrarily distrust third parties who say the capabilities match what’s promised. Mozilla for example says it found hundreds of Firefox vulnerabilities, and I think it’s pretty unlikely they’re lying to cover Anthropic’s back.

calgoo an hour ago | parent [-]

I think the question around the Firefox find, is not that they found hundreds of vulnerabilities - they found hundreds of bugs.

What would be really interesting is a side by side Claude Opus 4.7 and Mythos comparison.

vasco 3 hours ago | parent | prev [-]

Would AGI start by hacking competing labs to hamper their progress?

cdrnsf 26 minutes ago | parent | next [-]

No, because AGI is a fantasy.

Avicebron 3 hours ago | parent | prev [-]

You'll have to define what you mean by AGI

fodkodrasz 3 hours ago | parent [-]

AGI: Automatically Generating Income

gordonhart 2 hours ago | parent [-]

This is a surprisingly concrete and defensible definition of AGI.

Avicebron an hour ago | parent [-]

Is it defensible? It sounds like a thin disguise over "income for me but not for thee"?

redsocksfan45 28 minutes ago | parent [-]

[dead]