| ▲ | omneity a day ago | |
Mistral Large 3 is reportedly using Deepseek V3.2 architecture with larger experts and fewer of them, and a 2B params vision module. | ||
| ▲ | swores a day ago | parent [-] | |
According to whom? I haven't seen any claims of that being the case (other than you), just that there are similar decisions made by both of them. | ||