▲ | flashgordon 5 days ago | |||||||
wow - do you mind sharing any links to a specific setup? Also whats the biggest model anybody has run on this? | ||||||||
▲ | paxys 5 days ago | parent | next [-] | |||||||
You can run a decent model on it, say highly quantized Qwen or Deepseek R1 getting 5-10 tokens/sec output, but it will be nothing in comparison to a commercial offering like Claude, o3 or Gemini. For that you need a datacenter-class GPU going for $50K-100K a pop. | ||||||||
| ||||||||
▲ | lossolo 5 days ago | parent | prev | next [-] | |||||||
Unfortunately, you will not be able to run any model on this that is comparable to the Claude models. | ||||||||
▲ | icelancer 5 days ago | parent | prev [-] | |||||||
Every model you run on that setup will be at best half as good as Sonnet 4. |