| ▲ | segmondy 2 days ago |
| They have had the best math models for about a year most folks just didn't know about it. You can't find inference on APIs, but I run these at home, this is also the advantage of open models. https://huggingface.co/deepseek-ai/DeepSeek-Math-V2
https://huggingface.co/deepseek-ai/DeepSeek-Prover-V2-671B |
|
| ▲ | simonjgreen a day ago | parent | next [-] |
| You are of course specifically referring to the math optimised models, not the chat ones folks would generally encounter. Not that I’m trying to contradict you, your point is super valid and I agree with you! But I’m supplementing to help anyone following along who may make choices. This is when it happened for anyone interested: https://binaryverseai.com/deepseek-math-v2-benchmarks-review... |
| |
| ▲ | jug 20 hours ago | parent [-] | | Shouldn't one use e.g a Wolfram Alpha MCP endpoint for math in AI? From what I've seen on even premium non-quantized models, I would never ever trust the innate ability of a LLM to calculate. |
|
|
| ▲ | lowbloodsugar 2 days ago | parent | prev | next [-] |
| You run a 671B model at home? |
| |
| ▲ | segmondy 2 days ago | parent | next [-] | | Yes, and plenty of others do too. Quantizied. Join us at r/localllama My largest models 318G /llmzoo/models/Qwen3.5-397B
377G DeepSeekv3.2-nolight
380G /llmzoo/models/DeepSeek-V3.2-UD
400G /llmzoo/models/Qwen3.5-397B-Q8
443G DeepSeek-Math-v2
443G DeepSeek-V3-0324-Q5
522G /llmzoo/models/GLM5.1
545G /llmzoo/models/kimi2.6
546G /llmzoo/models/KimiK2.5
| | | |
| ▲ | tclancy 2 days ago | parent | prev | next [-] | | It's a big house. | |
| ▲ | UncleOxidant a day ago | parent | prev | next [-] | | Maybe if there was a 1-bit quant. | | |
| ▲ | barbacoa 16 hours ago | parent [-] | | Apple briefly was selling Mac studio with 512 GB of unified ram, meaning all that was available as vram. |
| |
| ▲ | 2 days ago | parent | prev [-] | | [deleted] |
|
|
| ▲ | verdverm 2 days ago | parent | prev [-] |
| Vertex AI has had deep seek available via API for a while |
| |
| ▲ | segmondy 2 days ago | parent [-] | | I'm talking about their specialized math models, not the general model. |
|