Remix clone Hacker News

new | show | ask | jobs Github

	▲	k__ 2 hours ago
		I tried some smaller Gemma4 and Qwen3.6 quants on my MBA with M5/16GB and had like 20-60 tokens per second. At 60 it felt pretty okay and that hardware is on the lower end. I'd assume a Mac with 32-64GB memory would get some reasonable results.