Remix.run Logo
rcxdude 2 hours ago

Unified Memory is mainly how consumer hardware has enough RAM accessible by the GPU to run larger models, because otherwise the market segmentation jacks up the price substantially.

bigyabai an hour ago | parent [-]

UMA removes the PCIe bottleneck and replaces it with a memory controller + bandwidth bottleneck. For most high-performance GPUs, that would be a direct downgrade.

zozbot234 26 minutes ago | parent [-]

> For most high-performance GPUs, that would be a direct downgrade.

You really can't say that, it depends on what you're running. If your model fits within a dGPU's VRAM then yes, obviously, but plenty of models are larger.