Remix.run Logo
You gave me a u32. I gave you root. (io_uring ZCRX freelist LPE)(ze3tar.github.io)
125 points by MrBruh 5 hours ago | 79 comments
saghm 3 hours ago | parent | next [-]

> No bounds check. free_count is incremented before the write, and the write uses the pre-increment value as the index. When free_count == num_niovs at entry, the write goes to freelist[num_niovs], one slot past the end.

"No way to prevent this", Says Only Language Where This Regularly Happens

musicale 2 hours ago | parent | next [-]

> "No way to prevent this", Says Only Language Where This Regularly Happens

   clang -fbounds-safety ...
also see lib0xc etc.: https://news.ycombinator.com/item?id=47978834
dvt 3 hours ago | parent | prev | next [-]

Obviously the way to prevent this is by bounds checking, which is literally in the `770594e` patch. It's just a bug and they happen routinely in all languages. Since this is doing pointer arithmetic, it could just as easily happen in unsafe Rust, for example.

gpm 3 hours ago | parent | next [-]

Like they said, "no way to prevent this" (kind of bug from happening again).

mikestorrent 3 hours ago | parent [-]

Static analysis and other tools can find this, but they're expensive; wonder what the kernel team has access to?

PlasmaPower 3 hours ago | parent | next [-]

If static analysis could actually find these issues with a reasonable false positive rate, the companies behind them would be running them on Linux to get the publicity of having found the issues like all the AI companies are doing now. Imo the good static analysis heuristics are already built into compilers or in open source linters.

canucker2016 10 minutes ago | parent [-]

The cheap, low-hanging "fruit" lint rules have been added to today's C/C++ compilers. But these rules can be fragile, depending on what level the static analysis scan occurs - source-code-level-textual pattern matching or use of an AST/parse tree.

Possible problems within a function should be discoverable.

This particular bug would be hard to discover for a typical linter unless they knew/remembered that there are two execution paths for cleanup of a given element.

canucker2016 30 minutes ago | parent | prev | next [-]

Coverity scans several open source projects for free. see https://scan.coverity.com/faq and https://scan.coverity.com/projects

see https://scan.coverity.com/projects/linux for the linux-specific scan results - you need to create an account to view the reported defects.

This past couple of weeks isn't a good look for them with the releases of defects found in Linux and Firefox.

TheAdamist 3 hours ago | parent | prev | next [-]

If not static analysis what would ai tools be considered? They're operating off the same source code

Also nice the onion reference by op.

PlasmaPower 2 hours ago | parent | next [-]

"static analysis" is usually deterministic rules you can e.g. put in CI. AI is also somewhat dynamic in that it can execute commands to try stuff out. The best AI vuln finding harnesses work that way, by essentially putting the AI inside of a fuzzer-like environment and telling it to produce a crash.

wizzwizz4 2 hours ago | parent | prev [-]

It's a reference to Xe Iaso's blog (e.g. https://xeiaso.net/shitposts/no-way-to-prevent-this/CVE-2025...), which is itself a reference to The Onion.

saghm 39 minutes ago | parent [-]

It's possible I had seen that blog post and not remembered! I was intending to reference the Onion though (and even googled to make sure I had the wording right), but seeing someone else make the same joke and forgetting is certainly something I would do

ivan_gammel 3 hours ago | parent | prev [-]

Technically, the kernel team is sufficiently competent to design and build bespoke tools for themselves. It‘s probably a question of risk assessment and priorities.

ellieh 2 hours ago | parent | prev | next [-]

sure, but with unsafe Rust you have a very clear marking for the section of code that requires additional care and attention. it is also customary to include a "SAFETY" comment outlining why using unsafe is OK here

dvt 2 hours ago | parent [-]

You actually kind of don't, I use like a zillion crates which have unsafe Rust in them and it's not like I'm sitting here reading every single line of their code. I like Rust for various reasons, but its memory safety is (imo) overstated, especially when doing low-level stuff.

josephg 2 hours ago | parent [-]

Almost all rust (95%) is safe rust. You can opt out of array bounds checks with unsafe { array.get_unchecked(idx) } instead of just typing array[idx]. But I can't remember the last time I saw anyone actually do that in the wild. Its not common practice, even in most low level code.

Rust is bounds checked by default. C is not. Defaults matter because, without a convincing reason, most people program in the default way.

amluto 2 hours ago | parent | prev | next [-]

But one would have to explicitly choose to use unsafe Rust for this instead of ordinary safe Rust. And safe Rust has no particular difficulty writing to slots in an array or slice or vector specified by their index.

skullone 2 hours ago | parent [-]

except nearly everyone uses unsafe rust

saghm an hour ago | parent | next [-]

"unsafe Rust" is not a binary; you don't opt into it for every single line of code. Given that the entire premise behind the idea that using C instead of Rust is fine is that people should be able to pay close attention and not make mistakes like this, having the number of places you need to look be a tiny fraction of the overall code that's explicitly marked as unsafe is a massive difference from C where literally every line of the code could be hiding stuff like this.

josephg 2 hours ago | parent | prev | next [-]

No they really don't. 95% of rust is safe rust[1].

Also unsafe rust doesn't remove bounds checks. arr[idx] is bounds checked in every context.

You can opt out of array bounds checking by writing unsafe { arr.get_unchecked(idx) } . But thats incredibly rare in practice.

[1] https://cs.stanford.edu/~aozdemir/blog/unsafe-rust-syntax/

overfeed 22 minutes ago | parent [-]

> 95% of rust is safe rust.

Based on the raw number of assorted crates, which has no bearing on kernel code. The more relevant question is, can a performant, cross-architecture, kernel ring-buffer be written in safe Rust?

Jtsummers 2 hours ago | parent | prev [-]

> except nearly everyone uses unsafe rust

Really? Why? I've not used Rust outside of some fairly small efforts, but I've never found a reason to reach for unsafe. So why is "nearly everyone" else using it?

dvt 2 hours ago | parent [-]

Let's say you want to call win32 (or Mac) OS functions, all of a sudden you're doing all kinds of wonky pointer stuff because that's how these operating systems have been architected. Doing unsafe stuff is pretty inevitable if you want to do anything non-hello-world-ish.

Jtsummers 2 hours ago | parent | next [-]

> Doing unsafe stuff is pretty inevitable if you want to do anything non-hello-world-ish.

So the vast majority of Rust projects involve writing at least one unsafe block? Is that really your claim?

greiskul 2 hours ago | parent [-]

And even if you do end up writing an unsafe block, that should be a massive flag that the code in said block should deserve extra comments on why it is safe, and extra unit tests on verifying that it does not blow up.

How do you know the unsafe operation is safe? What are the preconditions the code block has? Write it down, review it, test it.

saghm an hour ago | parent [-]

Exactly; I feel like a lot of people seem to misunderstand what Rust is trying to solve. It's fundamentally not trying to make unsafe code impossible; it's making the number of places you need to audit it a tiny fraction of your codebase compared to needing to audit the entirety of a C or C++ codebase. When I'm doing code reviews, you'd better believe I'm going to spend some extra time on any unsafe block I see to figure out if it's necessary and if so, if it's actually safe safe (with the default assumption for both of those being that they're not until I can convince myself otherwise).

skydhash 31 minutes ago | parent [-]

The thing is you can actually write quite good C code (see OpenBSD project). The power of C is that it's pragmatic. It lets you write code with you taking the full responsibility of being a responsible person. To err is human, but we developed a set of practices to handle this (by making sure the gun is unloaded and the safety is on before storing it to avoid putting holes in feet).

I like type checking and other compile time checks, but sometimes they feel very ceremonial. And all of them are inference based, so they still relies on the axiom being right and that the chain of rules is not broken somewhere. And in the end they are annotations, not the runtime algorithm.

dralley an hour ago | parent | prev | next [-]

A tiny fraction of programs need to use win32 or Mac OS functions beyond the standard library or other safe wrappers for said functions.

josephg 2 hours ago | parent | prev | next [-]

Making use of win32 functions doesn't turn off bounds checking in your rust code.

amluto 2 hours ago | parent | prev [-]

So what? Just because you used the keyword `unsafe` to call an unsafe API does not mean that you are going to use unsafe pointer access to write to a vector.

Rygian 3 hours ago | parent | prev [-]

That's not prevention. That's remediation.

themafia 2 hours ago | parent | prev [-]

And you see a lot of other languages being used to create operating systems with complicated multiprocessor and locking semantics?

FriedFishes 4 hours ago | parent | prev | next [-]

I can't quite make out if this is new or not. The attack vector here seems congruent with a similar exploit from a couple months ago [1]

But still might be an open threat. On the email thread Jens seems to think that this is already patched and in stable, he also points out that for this exploit to work (as written in the article) you already need escalated privileges [2] Catchy title though.

[1] https://snailsploit.com/security-research/general/io-uring-z...

[2] https://seclists.org/oss-sec/2026/q2/448

rishabhaiover 5 hours ago | parent | prev | next [-]

What is happening? I see multiple outages and CVEs is being reported on HN's front page. I've never seen these many security/incident related posts on HN's front page.

spindump8930 5 hours ago | parent | next [-]

Some combination of reporting bias given concerns about LLM security capabilities and actual new vulnerabilities found with LLM assistance. Even if exploits and outages are unrelated to LLMs, I'm certainly thinking about whether claude could build these things (or if actors already have).

NitpickLawyer 4 hours ago | parent | prev | next [-]

> What is happening?

Slowly at first, and then suddenly. AI assisted anything follows this trend. As capabilities improve, new avenues become "good enough" to automate. Today is security.

john_strinlai 4 hours ago | parent | prev | next [-]

i believe a good portion of the cves hitting the front page are moreso because they are ai-related (found partially/in whole by ai) and make for quick upvotes.

sva_ 44 minutes ago | parent | prev | next [-]

A mix of AI and hybrid warfare.

majorchord 5 hours ago | parent | prev | next [-]

AI is happening.

cachius 5 hours ago | parent [-]

In each recent case?

gordonhart 4 hours ago | parent [-]

AI assistance was explicitly disclosed on yesterday's. Today's has Claude as one of two contributors on this GitHub Pages site at least so it's also very likely.

Agents are capable of finding this kind of stuff now and people are having a field day using them to find high-profile CVEs for fun or profit.

gilrain 5 hours ago | parent | prev | next [-]

Automated vulnerability discovery via LLM.

ryandrake 2 hours ago | parent | next [-]

Anyone care to share which models and which prompts actually lead to finding these kinds of vulnerabilities? Or the narrowing-down workflow that can get an LLM to discover them? Surely just telling claude "Find all vulnerabilities in this project LOL" isn't enough? I hope?

Arcuru an hour ago | parent [-]

The Anthropic researchers have said their flow is as simple as:

1. Pick a file to seed as a starting place.

2. Ask the LLM (in an agent harness) to find a vulnerability by starting there.

3. If it claims to have found something, ask another one to create an exploit/verify it/prove it or whatever.

4. If both conclude there is a vuln, then with the latest models you almost certainly found something real.

Just run it against every file in a repo, or select a subset, or have an LLM select files with a simple "what X files look likely to have vulns?".

So basically yes, it is that simple. It's just a matter of having the money to pay for the tokens.

ryandrake an hour ago | parent [-]

Thanks for the reply. Pretty remarkable.

pixl97 4 hours ago | parent | prev [-]

Everyone was talking about how Mythos was overblown marketing, and while it may be, they missed the forest for the trees. Capabilities have been escalating for a year now and we're at the point of widespread impact. I don't suspect we'll see a slowdown for a long time.

themafia 2 hours ago | parent | prev [-]

Perhaps it was the prior quiescent period that was the anomaly.

stonegray 4 hours ago | parent | prev | next [-]

> “and is writable with CAP_SYS_ADMIN”

Am I reading this wrong or is this just a way of executing an arbitrary binary with uid=0 if you have both CAP_NET_ADMIN and CAP_SYS_ADMIN?

If you can write modprobe_path, is it really news that you can find a way to execute code?

PlasmaPower 3 hours ago | parent | next [-]

No, you can grant yourself this inside an unprivileged user namespace. `unshare -Ur capsh --print` lists the capabilities inside a user namespace and demonstrates that it has both CAP_SYS_ADMIN and CAP_NET_ADMIN.

Almost all distros allow unprivileged user namespaces, and in my opinion this is the right decision, because they're important for browser sandboxing which I think is more important than LPEs.

delusional 2 hours ago | parent [-]

I don't think namepsace CAP_SYS_ADMIM grants you access to write non namespaces sysctls like modprobe_path

PlasmaPower 25 minutes ago | parent [-]

You're probably right, but that seems like the less important part of this. At that point you've already got an out-of-bounds write. Another comment speculated that you could use PageJack as an alternative exploit path once you have that primitive: https://news.ycombinator.com/item?id=48069623

pizzalife 4 hours ago | parent | prev [-]

Right. `CAP_SYS_ADMIN` is for all intents and purposes equivalent to root.

JoeDohn 19 minutes ago | parent | prev | next [-]

So this is another CVE? Or am I misreading this one? "Copy‑fail", "DirtyFrag", now "IUrinegOnYou :)"?

Joke aside, we'll see more CVEs in the coming months, and in a sense that's good: it leaves less maneuvering room for bad actors (especially those selling them to the highest bidder).

csmantle an hour ago | parent | prev | next [-]

I first read this from the author's posting to oss-security. Turns out that the author did agree to revise the blog post for the "admin cap for root shell" part [^0]. [^1] would probably tell more.

The title looks like clickbait to me.

[^0]: https://www.openwall.com/lists/oss-security/2026/05/08/10

[^1]: https://www.openwall.com/lists/oss-security/2026/05/08/14

pamcake an hour ago | parent | prev | next [-]

This kind of post really shouldn't require client-side js — from third-party domain — to read...

static markdown version: https://raw.githubusercontent.com/ze3tar/ze3tar.github.io/9d...

kro 4 hours ago | parent | prev | next [-]

CAP_NET/SYS_ADMIN is required for this. So this would be "not as bad" as the others.

kam 2 hours ago | parent | next [-]

Also "The page pool is only created on a real ZCRX-capable NIC (mlx5 ConnectX-6+, Intel E800, NFP)"

t0mas88 3 hours ago | parent | prev [-]

It could work for container escape?

dundarious an hour ago | parent | prev | next [-]

How many systems have the relevant NICs, and followed the non-automatic setup steps in https://docs.kernel.org/networking/iou-zcrx.html, and are not running within a VM/container disabling io_uring?

This seems on the low impact end of the numerous historical io_uring issues.

Interesting and important all the same.

shorden 2 hours ago | parent | prev | next [-]

Interesting, I haven't tested this myself but intuitively I think that a 4 byte OOB write is plenty for a data-only attack like [PageJack](https://i.blackhat.com/BH-US-24/Presentations/US24-Qian-Page...), so I don't think hardening against the KASLR leaks discussed in OP would necessarily save you from this attack.

staticassertion 5 hours ago | parent | prev | next [-]

io-uring is a security nightmare. Constant privescs and a powerful primitive for syscall smuggling. Worth considering disabling it outright (already the case for most containers afaik).

otterley 4 hours ago | parent [-]

At one point, Google disabled io_uring on its production servers (https://security.googleblog.com/2023/06/learnings-from-kctf-...) - I don't know whether this is still true, though. Perhaps a Google can confirm.

vsgherzi 4 hours ago | parent [-]

super curious on this one as well, last I heard they've been enabling it slowly

ctoth 2 hours ago | parent | prev | next [-]

If this many are public right now, what does that say about the dark matter of private ones? What's the typical public-private rate for this sort of thing/can someone help me calibrate my base rate expectations?

SubiculumCode 3 hours ago | parent | prev | next [-]

Do most servers need this? Or can most of us 'sysctl -w kernel.io_uring_disabled=2 ' ?

baq 5 hours ago | parent | prev | next [-]

What’s our prior for p(doom) today…?

rvz 5 hours ago | parent | prev [-]

Another one.

Linux is falling apart faster than it can assign these CVEs.

hn92726819 4 hours ago | parent | next [-]

Falling apart? You mean getting stronger? Every single one of these is an existing hole being patched. It isn't making new holes

Gigachad an hour ago | parent [-]

Government agencies probably already have half of these exploits in their private toolbox for years now. Finding and patching them is good, but there probably needs to be some systematic change to prevent them rather than just patching bugs when they get found.

gordonhart 4 hours ago | parent | prev | next [-]

Linux is "falling apart" because it's the highest-profile open source project people can point LLM agents at to find CVEs. It'll come out the other end of this hardened by all of the attention it's getting, but the next few months/years will be... bumpy.

maven29 5 hours ago | parent | prev | next [-]

perhaps this will lead to better AppArmor and SELinux defaults?

ChocolateGod 4 hours ago | parent [-]

People will just turn SELinux off rather than have to go through the horrible tooling when it breaks a regular use case.

yjftsjthsd-h 4 hours ago | parent [-]

I do think SELinux is a good example of how robust software with poor UX/DX gets undermined by that poor UX/DX. Although I do wonder if AI can help with it?

EGreg 5 hours ago | parent | prev [-]

How's BSD doing? How about Amazon Linux?

yjftsjthsd-h 4 hours ago | parent | next [-]

Amazon Linux is a Linux distro? Though, yes, I would like to know how the BSDs are doing.

otterley 4 hours ago | parent [-]

Yes, it's a fork of Fedora. https://docs.aws.amazon.com/linux/al2023/ug/what-is-amazon-l...

toast0 4 hours ago | parent | prev | next [-]

FreeBSD is getting piles of security updates lately too. Not sure about the other BSDs.

cachius 5 hours ago | parent | prev [-]

And Windows?

mschuster91 4 hours ago | parent [-]

Pray to God no one ever lets an AI agent run loose on the various leaked Windows source code dumps.

Given Windows' absurd amount of backwards compatibility, chances are pretty high that there are a lot of sleeping dragons buried inside even modern Windows 10/11 kernel and userland that date back to code and issues from the 90s - code where half the people who have worked on it probably not just have departed Microsoft but departed living in the meantime.