| ▲ | swores a day ago | |||||||
Agree with you over OP - as well as Qwen there's others like Mistral, Meta's Llama, and from China there's the likes of Baidu ERNIE, ByteDance Doubao, and Zhipu GLM. Probably others too. Even if all of these were considered worse than the "only 5" on OP's list (which I don't believe to be the case), the scene is still far too young and volatile to look at a ranking at any one point in time and say that if X is better than Y today then it definitely will be in 3 months time, yet alone in a year or two. | ||||||||
| ▲ | omneity a day ago | parent [-] | |||||||
Mistral Large 3 is reportedly using Deepseek V3.2 architecture with larger experts and fewer of them, and a 2B params vision module. | ||||||||
| ||||||||