Remix.run Logo
twoodfin 6 days ago

If Intel had a server SKU with fully integrated, competitive performance GPU cores that work with CUDA + unified memory, they’d sell billions worth in a day to the CSPs alone.

Sounds like they will someday soon.

There will always be giant, faraway GPU supercomputer clusters to train models. But the future of inference (where the model fits) is local to the CPU.