Remix clone Hacker News

new | show | ask | jobs Github

	▲	mortenjorck 3 hours ago
		I assume those don't just work automatically with an off-the-shelf gguf. What do you need in your local inference stack to take advantage of M5's neural accelerators?
	▲	aurareturn 3 hours ago \| parent [-]
		They do work with llama.cpp and MLX automatically.