▲ | aurareturn 6 days ago | |||||||
Yes, they do. They're called Neural Engine, aka NPUs. They aren't being used for local LLMs on Macs because they are optimized for power efficiency running much smaller AI models. Meanwhile, the GPU is powerful enough for LLMs but has been lacking matrix multiplication acceleration. This changes that. | ||||||||
▲ | astrange 6 days ago | parent | next [-] | |||||||
The neural engine is used for the built-in LLM that does text summaries etc., just not third party LLMs. And there's an official port of Stable Diffusion to it: https://github.com/apple/ml-stable-diffusion | ||||||||
▲ | mrheosuper 6 days ago | parent | prev | next [-] | |||||||
I thought 1 of the reason we do ML on GPU is fast Matrix multiplication ? So the new engine is accelerator for matmul accelerator ? | ||||||||
| ||||||||
▲ | cchance 6 days ago | parent | prev [-] | |||||||
These are different these are built into the GPU Cores |