Remix.run Logo
Animats 11 hours ago

That's neat.

A hard problem in optimization today is trying to fit code into the things complex SSE-type instructions can do. Someone recently posted an example where they'd coded a loop to count the number of one bits in a word, and the compiler generated a "popcount" instruction. That's impressive.

mattgodbolt 9 hours ago | parent [-]

It may be a different post, but I covered this earlier this month in the same series of blog posts/YouTube videos.