Remix.run Logo
fractorial 12 hours ago

Not "local" in the literal sense, but I set it up to serve at half quant for $23/hr and full quant for $35/hr.

You don't need to have it always on? This is a far cry from "$200/month," but I do not think it's $50k for "useful." Do you see it differently?

dakolli 11 hours ago | parent [-]

This is probably the dumbest possible way to do it. Just buy tokens through open router and you could run it all month 24/7 at 100tps for practically nothing. There are tons of ways to pay for things without giving your personal information.

greenavocado 6 hours ago | parent [-]

  100/s*month*(.14/million) = $37
$37 for the input tokens for Deepseek V4 Flash if you miss cache all the time.

A decent deal but Flash is quite dumb and you still have to pay for output tokens