Remix clone Hacker News

new | show | ask | jobs Github

	▲	kennywinker 3 hours ago
		Their example big earner models are FLUX.2 Klein 4B and FLUX.2 Klein 9B, which i imagine could generate a lot more tokens/s than a 26B model on your machine. For Gemma 4 26B their math is: single_tok/s = (307 GB/s / 4 GB) * 0.60 = 46.0 tok/s batched_tok/s = 46.0 * 10 * 0.9 = 414.4 tok/s tok/hr = 414.4 * 3600 = 1,492,020 revenue/hr = (1,492,020 / 1M) * $0.200000 = $0.2984 I have no idea if that is a good estimate of how much an M5 Pro can generate - but that’s what it says on their site. They do a bit of a sneaky thing with power calculation: they subtract 12Ws of idle power, because they are assuming your machine is idling 24/7, so the only cost is the extra 18W they estimate you’ll use doing inference. Idk about you, but i do turn my machine off when i am not using it.