Remix clone Hacker News

new | show | ask | jobs Github

	▲	lzaborowski 5 hours ago
		One thing I’ve noticed with local models is that people tolerate a lot more trial and error behavior. When a hosted model wastes tokens it feels expensive, but when a local model loops a bit it just feels like it’s “thinking.” If models like Qwen can get good enough for coding tasks locally, the real shift might be economic rather than purely capability.
	▲	trvz 2 hours ago \| parent [-]
		Wasted tokens are preferred for local models, I need the GPU mainframe in my bedroom to heat it as I live in a third world country with unreliable heating (Switzerland).