| ▲ | qarl 2 hours ago | |||||||||||||||||||||||||||||||||||||||||||
The model we're discussing (Deepseek) is open weight. | ||||||||||||||||||||||||||||||||||||||||||||
| ▲ | kube-system an hour ago | parent [-] | |||||||||||||||||||||||||||||||||||||||||||
Perhaps your prior comment would’ve been better received if it said that specifically instead of “Chinese models”. But also, the latest DeepSeek is 1.6T parameters. “Choosing” to run this locally is a choice that comes with a seven digit price tag, and is a sunk cost that will probably not run any other frontier model anytime soon. Most organizations are not looking to spend millions of dollars trying to find a workaround to specifically run DeepSeek. Most enterprise consumption in this space is still very experimental and a pay as you go model is much more palatable. Most are simply just looking for three checkboxes: is it close to frontier performance, is it compliant with my organizations requirements, and is it a good price? DeepSeek can only do two of the three at the same time. | ||||||||||||||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||||||||||||||