Remix.run Logo
spwa4 6 hours ago

TLDR: Mistral Medium 3.5, text-only, 128B dense model, 256k context window, modified MIT license. Model is ~140G ...

https://huggingface.co/mistralai/Mistral-Medium-3.5-128B

They more or less claim this exceeds Claude Sonnet 3.5 on most things, but is worse than Sonnet 3.6, and exceeds all other open models.

Oh and they have a cloud service that will code your apps "in the cloud". But, yeah, at this point, so does my cat.

And, yes, unsloth is on it: https://huggingface.co/unsloth/Mistral-Medium-3.5-128B-GGUF (but 4bit quant is 75G)

wolttam 6 hours ago | parent | next [-]

Sonnet 4.5 and 4.6*

There is no way it exceeds “all other” open models - but it does exceed all of Mistral’s past models.

You can see it getting blown past by GLM 5.1 and Kimi in this.

Still excited to give it a try

2ndorderthought 5 hours ago | parent [-]

It looks like qwen 3.6 is winning and smaller for the April small model roll out

pama 6 hours ago | parent | prev | next [-]

Unfortunately they only compare to old “all other open models”. There are probably over 10 other open models better than it by now.

Marciplan 6 hours ago | parent | prev [-]

You mean Sonnet 4.5 and 4.6 riight

spwa4 5 hours ago | parent [-]

right