Remix.run Logo
s3p 5 hours ago

>But new A.I. models like Anthropic’s Mythos, which was announced last month, appear to be so good at finding such holes that Anthropic shared it only with a limited number of firms and government agencies in the United States and Britain.

Immediate distrust of the article. GPT 5.5 is out with nearly the same capability. The author might be parroting company marketing, unable to discern that a lot of this is much less complex than it seems. For all we know this group could have had a model examine some obscure line of code thousands of times until it found something.

cobolcomesback 4 hours ago | parent | next [-]

GPT 5.5 does not have the same capabilities as Mythos. There is a separate 5.5-Cyber model which is the Mythos “equivalent”, but it is similarly restricted access like Mythos. Per OpenAI, the major difference is the built-in safeguards that 5.5 (and other models have), where 5.5-Cyber does not have these safeguards and is more “permissive” for security work.

See https://openai.com/index/gpt-5-5-with-trusted-access-for-cyb...

ofjcihen 4 hours ago | parent | next [-]

I have access to the Cyber version. It’s great at cybersecurity work but only marginally better than its predecessor with the right jailbreaking.

I imagine Mythos is going to be the same story from what I’ve seen so far.

esseph 2 hours ago | parent [-]

https://www.theregister.com/security/2026/05/11/anthropics-b...

ofjcihen an hour ago | parent [-]

Well hey, there you have it

nullstyle 3 hours ago | parent | prev [-]

That reminds me:

I got cajoled the other day that I need to upload my ID and ask for 5.5-Cyber access by the Codex desktop app while I was having it develop a fuzzing suite for an open source library I'm(we?) are developing. I was able to berate it into getting back to work.

This struck me as a point of emergent enshittification; an anus if you will.

vgalin 2 hours ago | parent [-]

The company doing the actual ID verification (KYC) is probably the last company I'd trust with this kind of data.

To circumvent conversations being flagged as "cybersecurity bad!!!" I often have to use previous models (5.3 for example, and sometimes using them through subagents is enough). And when this method no longer works, local models will be good enough for it to not be a problem (for my use case, at least).

bluGill 4 hours ago | parent | prev | next [-]

That is very clearly the claim of mythos though. The experience of projects that do have access to mythos though suggests that if you use the other models it's not going to find much of anything. Which is to say generally we believe it is marketing as you say however the claim that the reporter said is very clearly stated even if it's not right.

3 hours ago | parent | prev | next [-]
[deleted]
xorgun 3 hours ago | parent | prev | next [-]

[dead]

reaperducer 5 hours ago | parent | prev [-]

Immediate distrust of the article… The author might be parroting company marketing, unable to discern that a lot of this is much less complex than it seems.

https://www.nytimes.com/by/dustin-volz

> I am based in The Times’s Washington bureau, and much of my focus is on the dealings of U.S. cybersecurity and intelligence agencies, including the National Security Agency, Central Intelligence Agency, Cybersecurity and Infrastructure Security Agency and the Federal Bureau of Investigation, as well as their counterparts abroad, chiefly in China, Russia, Iran and North Korea.

> My remit spans nation-state hacking conflict, digital espionage, online influence operations, election meddling, government surveillance, malicious use of A.I. tools and other related topics.

> Before joining The Times, I worked at The Wall Street Journal, where I spent eight years covering cyber conflict and intelligence. My recent work at The Journal included a series of articles revealing a major Chinese intrusion of America’s telecommunications networks that breached the F.B.I.’s wiretap systems and has been described as one of the worst U.S. counterintelligence failures in history. I have also worked at Reuters and National Journal, where I began my career in Washington chronicling congressional efforts to reform surveillance practices at the N.S.A. in the wake of the 2013 Edward Snowden disclosures.

> My work has been internationally recognized, including by the White House Correspondents’ Association, the Gerald Loeb Awards, the Society of Publishers in Asia and the Society for Advancing Business Editing and Writing.

What have you done lately?

kubik369 4 hours ago | parent | next [-]

Your comment was surely well meant, but you could have plainly stated that the article author is a seasoned reporter instead of the snarky reply.

GP might be incorrect in stating that the author is parroting Anthropic's marketing, but the author certainly does not go out of his way to specify that these are only Anthropic's claims. It is actually a bit ironic as the article linked[0] from the quoted part (by another author) uses the correct phrasing when dealing with such claims:

> Anthropic, the artificial intelligence company that recently fought the Pentagon over the use of its technology, has built a new A.I. model that it claims is too powerful to be released to the public.

[0] https://archive.ph/GC6WP#selection-4713.0-4713.200

LPisGood 4 hours ago | parent | prev | next [-]

> What have you done lately?

I feel like this website is a particularly dangerous place to ask that and hope it to be a “mic drop” moment. There are a lot of highly accomplished engineers, scientists, founders CEOs, etc. here that could easily respond to that with any manner of impressive qualifications.

esafak 3 hours ago | parent [-]

https://news.ycombinator.com/item?id=35079

LudwigNagasena 4 hours ago | parent | prev | next [-]

Reporting on such stuff requires networking skills, not technical knowledge.

reaperducer 4 hours ago | parent [-]

Reporting on such stuff requires networking skills, not technical knowledge.

Guess how I know you've never been a reporter.

crazygringo 3 hours ago | parent | prev | next [-]

Your comment would be be fine without the snarky final sentence.

ofjcihen 4 hours ago | parent | prev | next [-]

Okay, well I’ve done more than that and I say he’s right. Now what?

himata4113 4 hours ago | parent | prev | next [-]

nytimes reporters have recently been very disappoiting and starting to feel like they're people who managed to become relevant long time ago, but haven't kept up with recent changes and are just parroting things others have said instead of unique thoughts.

anjel 3 hours ago | parent | next [-]

I found their recent investigative article on How do stars pee at the Met Gala? to be hard-hitting, yet fair to all sides. [1]

[1] https://archive.is/x9MSO

(You thought I was exaggerating about it being "investigative," dincha.)

Conscat 2 hours ago | parent | prev [-]

Any media company which deliberately rids itself of everyone willing to speak vaguely positively of transsexual people may not be attracting the most free thinking writers.

flextheruler 4 hours ago | parent | prev | next [-]

https://www.logicallyfallacious.com/logicalfallacies/Appeal-...

reaperducer 4 hours ago | parent [-]

Not at all.

OP posited that the author didn't know what he's talking about. I pointed out that the author has far more knowledge and experience in the field than rando internet griefers on HN who immediately reach for "shoot the messenger" when they read something that doesn't neatly fit into their pre-conceived worldview, instead of perhaps learning things from other people.

But at least your trope acknowledges that he's an authority on the subject.

nitwit005 3 hours ago | parent | next [-]

> I pointed out that the author has far more knowledge and experience in the field than rando internet griefers on HN

You mean, you guessed that a random person online lacked experience. The experts are genuinely here too.

ssl-3 4 hours ago | parent | prev [-]

> OP posited that the author didn't know what he's talking about.

That position does not appear to be present.

JumpCrisscross 4 hours ago | parent [-]

Eh, "unable to discern" seems like a polite way of saying someone is talking out of their ass.

megous 4 hours ago | parent | prev [-]

How many zeroday vulns had the article author discovered using AI assisted methods?