| ▲ | Aurornis 3 hours ago | ||||||||||||||||
> Apple’s unified memory architecture plays a huge part in this. This will trigger a large scale rearchitecture of mobile hardware across the board. I am sure they are already underway. Putting the GPU and CPU together and having them both access the same physical memory is standard for phone design. Mobile phones don't have separate GPUs and separate VRAM like some desktops. This isn't a new thing and it's not unique to Apple > I understand this is for a demo but do we really need a 400B model in the mobile? A 10B model would do fine right? What do we miss with a pared down one? There is already a smaller model in this series that fits nicely into the iPhone (with some quantization): Qwen3.5 9B. The smaller the model, the less accurate and capable it is. That's the tradeoff. | |||||||||||||||||
| ▲ | alwillis 3 hours ago | parent [-] | ||||||||||||||||
> Putting the GPU and CPU together and having them both access the same physical memory is standard for phone design. > Mobile phones don't have separate GPUs and separate VRAM like some desktops. That's true. The difference is the iPhone has wider memory buses and uses faster LPDDR5 memory. Apple places the RAM dies directly on the same package as the SoC (PoP — Package on Package), minimizing latency. Some Android phones have started to do this, too. iOS is tuned to this architecture which wouldn't be the case across many different Android hardware configurations. | |||||||||||||||||
| |||||||||||||||||