Remix clone Hacker News

new | show | ask | jobs Github

	▲	fleventynine an hour ago
		If local models are good enough, doesn't that increase demand for DRAM as everyone buys DRAM for their poorly utilized local machines? Surely it is a more efficient use of DRAM to run inference on shared hardware with large batch sizes and more utilization.
	▲	szatkus 22 minutes ago \| parent [-]
		Luckily very few people can configure and are interested in local models. But your nearby datacenter running Chinese open-weight models is also good enough.