| ▲ | Lucasoato an hour ago | |
Do you know what kind of machine do I need to run the original DeepSeek v4 pro model with a good tok/s throughput? | ||
| ▲ | karmakaze 3 minutes ago | parent | next [-] | |
DeepSeek v4 pro is still rather large, DeepSeek-V4-Flash[0] becomes relatively more reasonable with smaller quantizations and eventually will be able to effectively offload 'facts' to system RAM. See DwarfStar 4[1] for current sweet spots. | ||
| ▲ | zamalek an hour ago | parent | prev [-] | |
It's not really plausible to host at home, unless you have deep pockets. What you/we win here is a model that doesn't suddenly become worse like the proprietary ones have been doing, and you can choose a provider from a competitive market. | ||