| ▲ | spwa4 6 hours ago | |||||||
TLDR: Mistral Medium 3.5, text-only, 128B dense model, 256k context window, modified MIT license. Model is ~140G ... https://huggingface.co/mistralai/Mistral-Medium-3.5-128B They more or less claim this exceeds Claude Sonnet 3.5 on most things, but is worse than Sonnet 3.6, and exceeds all other open models. Oh and they have a cloud service that will code your apps "in the cloud". But, yeah, at this point, so does my cat. And, yes, unsloth is on it: https://huggingface.co/unsloth/Mistral-Medium-3.5-128B-GGUF (but 4bit quant is 75G) | ||||||||
| ▲ | wolttam 6 hours ago | parent | next [-] | |||||||
Sonnet 4.5 and 4.6* There is no way it exceeds “all other” open models - but it does exceed all of Mistral’s past models. You can see it getting blown past by GLM 5.1 and Kimi in this. Still excited to give it a try | ||||||||
| ||||||||
| ▲ | pama 6 hours ago | parent | prev | next [-] | |||||||
Unfortunately they only compare to old “all other open models”. There are probably over 10 other open models better than it by now. | ||||||||
| ▲ | Marciplan 6 hours ago | parent | prev [-] | |||||||
You mean Sonnet 4.5 and 4.6 riight | ||||||||
| ||||||||