Remix clone Hacker News

new | show | ask | jobs Github

	▲	kristopolous 16 hours ago
		Fully aware of the DGX spark I've actually been looking into AMD Ryzen AI Max+ 395/392 machines. There's some interesting things here like https://www.bee-link.com/products/beelink-gtr9-pro-amd-ryzen... and https://www.amazon.com/GMKtec-5-1GHz-LPDDR5X-8000MHz-Display... ... haven't pulled the trigger yet but apparently inferencing on these chips are not trash. Machines with the 4xx chips are coming next month so maybe wait a week or two. It's soldered LPDDR5X with amd strix halo ... sglang and llama.cpp can do that pretty well these days. And it's, you know, half the price and you're not locked into the Nvidia ecosystem
	▲	ejpir 15 hours ago \| parent \| next [-]
		unfortunately the bigger models are pretty slow in token speed. The memory is just not that fast. You can check what each model does on AMD Strix halo here: https://kyuz0.github.io/amd-strix-halo-toolboxes/
	▲	Tepix 6 hours ago \| parent \| prev [-]
		4xx chips are less capable than the 395