taps the sign

  Unified Memory Is A Marketing Gimmeck. Industrial-Scale Inference Servers Do Not Use It.

▲ zozbot234 7 hours ago | parent [-]

Industrial Scale Inference is moving towards LPDDR memory (alongside HBM), which is essentially what "Unified Memory" is.

	▲	0x457 4 hours ago \| parent \| next [-]
		> which is essentially what "Unified Memory" is. Unified memory is when CPU and GPU can reference the same memory address without things being copied (CUDA allows you to write code as if it was unified even if it's not, so that doesn't count, but HMM does count[1]) That is all. What technology is underneath is hardware detail. Unified memory on macs lets you put something into a memory, then do some computation on it with CPU, ANE, ANA, Metal Shaders. All without copying anything. DGX Spark also has unified memory. [1]: https://docs.nvidia.com/cuda/cuda-programming-guide/02-basic...
	▲	bigyabai 7 hours ago \| parent \| prev [-]
		LPDDR is LPDDR. There's nothing "unified" about it architecturally.