Remix.run Logo
flashgordon 5 days ago

wow - do you mind sharing any links to a specific setup? Also whats the biggest model anybody has run on this?

paxys 5 days ago | parent | next [-]

You can run a decent model on it, say highly quantized Qwen or Deepseek R1 getting 5-10 tokens/sec output, but it will be nothing in comparison to a commercial offering like Claude, o3 or Gemini. For that you need a datacenter-class GPU going for $50K-100K a pop.

mtkd 4 days ago | parent [-]

But a small collective running that box, especially spanning timezones, could potentially be a viable alternative or will be soon -- with obv privacy gains too

lossolo 5 days ago | parent | prev | next [-]

Unfortunately, you will not be able to run any model on this that is comparable to the Claude models.

icelancer 5 days ago | parent | prev [-]

Every model you run on that setup will be at best half as good as Sonnet 4.