Remix.run Logo
segmondy 2 days ago

They have had the best math models for about a year most folks just didn't know about it. You can't find inference on APIs, but I run these at home, this is also the advantage of open models.

https://huggingface.co/deepseek-ai/DeepSeek-Math-V2 https://huggingface.co/deepseek-ai/DeepSeek-Prover-V2-671B

simonjgreen a day ago | parent | next [-]

You are of course specifically referring to the math optimised models, not the chat ones folks would generally encounter. Not that I’m trying to contradict you, your point is super valid and I agree with you! But I’m supplementing to help anyone following along who may make choices.

This is when it happened for anyone interested: https://binaryverseai.com/deepseek-math-v2-benchmarks-review...

jug 20 hours ago | parent [-]

Shouldn't one use e.g a Wolfram Alpha MCP endpoint for math in AI? From what I've seen on even premium non-quantized models, I would never ever trust the innate ability of a LLM to calculate.

lowbloodsugar 2 days ago | parent | prev | next [-]

You run a 671B model at home?

segmondy 2 days ago | parent | next [-]

Yes, and plenty of others do too. Quantizied. Join us at r/localllama

My largest models

   318G    /llmzoo/models/Qwen3.5-397B
   377G    DeepSeekv3.2-nolight
   380G    /llmzoo/models/DeepSeek-V3.2-UD
   400G    /llmzoo/models/Qwen3.5-397B-Q8
   443G    DeepSeek-Math-v2
   443G    DeepSeek-V3-0324-Q5
   522G    /llmzoo/models/GLM5.1
   545G    /llmzoo/models/kimi2.6
   546G    /llmzoo/models/KimiK2.5
danilocesar 2 days ago | parent | next [-]

Is your house's heating system based on H100s?

2 hours ago | parent [-]
[deleted]
Liftyee 2 days ago | parent | prev | next [-]

What hardware do you use?

MezzoDelCammin a day ago | parent | next [-]

I think the answer to this is:"yes"

CoolThings a day ago | parent | prev | next [-]

a Beowulf cluster of 256 x Raspberry Pi 3.

tclancy a day ago | parent | prev [-]

All of it.

chid a day ago | parent | prev [-]

even quantised, those are HUGE

tclancy 2 days ago | parent | prev | next [-]

It's a big house.

UncleOxidant a day ago | parent | prev | next [-]

Maybe if there was a 1-bit quant.

barbacoa 16 hours ago | parent [-]

Apple briefly was selling Mac studio with 512 GB of unified ram, meaning all that was available as vram.

2 days ago | parent | prev [-]
[deleted]
verdverm 2 days ago | parent | prev [-]

Vertex AI has had deep seek available via API for a while

segmondy 2 days ago | parent [-]

I'm talking about their specialized math models, not the general model.