Remix clone Hacker News

new | show | ask | jobs Github

	▲	pitched 4 hours ago
		For a business with ten or more engineers/people-using-ai, it might still make sense to set this up. For an individual though, I can’t imagine you’d make it through to positive ROI before the hardware ages out.
	▲	zozbot234 4 hours ago \| parent \| next [-]
		It's hard to tell for sure because the local inference engines/frameworks we have today are not really that capable. We have barely started exploring the implications of SSD offload, saving KV-caches to storage for reuse, setting up distributed inference in multi-GPU setups or over the network, making use of specialty hardware such as NPUs etc. All of these can reuse fairly ordinary, run-of-the-mill hardware.
	▲	DeathArrow 3 hours ago \| parent \| prev [-]
		Since you need at least a few of H100 class hardware, I guess you need at least few tens of coders to justify the costs.