Remix clone Hacker News

new | show | ask | jobs Github

	▲	PlatoIsADisease 4 hours ago
		What models are you running locally? Just curious. I am mostly restricted to 7-9B. I still like ancient early llama because its pretty unrestricted without having to use an abliteration.
	▲	mark_l_watson 3 hours ago \| parent [-]
		I experimented with many models on my 16G and 32G Macs. For less memory, qwen3:4b is good, for the 32B Mac, gpt-oss:20b is good. I like the smaller Mistral models like mistral:v0.3 and rnj-1:latest is a pretty good small reasoning model.