Remix clone Hacker News

new | show | ask | jobs Github

	▲	pshc 2 hours ago
		With batched parallel requests this scales down further. Even a MacBook M3 on battery power can do inference quickly and efficiently. Large scale training is the power hog.