Remix.run Logo
martinald a day ago

Yes but you could use the space on die for GPU cores.

heavyset_go 14 hours ago | parent [-]

At least with the embedded platforms I'm familiar with, dedicated silicon to NPU is both faster and more power efficient than offloading to GPU cores.

If you're going to be doing ML at the edge, NPUs still seem like the most efficient use of die space to me.