| ▲ | agambrahma a day ago | |
Agree, maybe "changes the physics" was too strong, shared cpu/gpu memory is not new. What is different then is the combination of 1. UMA memory (and yes, iGPU had this, pre-M1) 2. enough bandwidth / GPU throughput for local inference 3. straightforward `makeBuffer(bytesNoCopy:)` path So, the novelty isn't the shared memory itself, but the whole chain lining up to make the Wasm linear memory -> Metal-buffer approach practical + performant enough. (and not saying there's some Apple Silicon magic here either ... it'd work anywhere there was UMA and no-copy host-pointer path) | ||