Remix.run Logo
Lucasoato an hour ago

Do you know what kind of machine do I need to run the original DeepSeek v4 pro model with a good tok/s throughput?

karmakaze 3 minutes ago | parent | next [-]

DeepSeek v4 pro is still rather large, DeepSeek-V4-Flash[0] becomes relatively more reasonable with smaller quantizations and eventually will be able to effectively offload 'facts' to system RAM. See DwarfStar 4[1] for current sweet spots.

[0] https://huggingface.co/deepseek-ai/DeepSeek-V4-Flash

[1] https://news.ycombinator.com/item?id=48050842

zamalek an hour ago | parent | prev [-]

It's not really plausible to host at home, unless you have deep pockets. What you/we win here is a model that doesn't suddenly become worse like the proprietary ones have been doing, and you can choose a provider from a competitive market.