Remix clone Hacker News

new | show | ask | jobs Github

	▲	pmarreck 5 hours ago
		pretty much any of them, dude, as long as you have enough RAM, since it uses unified RAM and a powerful SoC CPU/GPU. Literally any M-class model, but the M5 is currently top tier.
	▲	dannyw 3 hours ago \| parent \| next [-]
		The DGX Spark has basically the same memory bandwidth as a M5 Pro, and far more than a M5. Only the M3 Ultra really beats it, and once you start scoping out the cost of a M3 Ultra with 128GB or 256GB, the DGX Spark doesn’t look bad after all.
	▲	mapontosevenths 4 hours ago \| parent \| prev \| next [-]
		Yep. Memory bandwidth is what decides how fast LLM's generate tokens (mostly). The DGX Spark has something like 270 GB/s of memory bandwidth, and the m5 ultra is ~615 GB/s. Theoretically DOUBLE the speed. In practice he only generates like 25% more tok/s, but that's still very impressive. The spark can fine tune models in 1/4 the time and excels at other compute tasks in ways that Mac never can. Plus the high bandwidth ConnectX-7 ports would be like $1700 to buy on a card just for the network adapters... But for generating tokens, it just plain loses.
	▲	fsuts 2 hours ago \| parent \| prev [-]
		How noisy does his fan get…