Remix.run Logo
Youden 8 hours ago

They mentioned LMArena, you can get the results for that here: https://lmarena.ai/leaderboard/text

Mistral Large 3 is ranked 28, behind all the other major SOTA models. The delta between Mistral and the leader is only 1418 vs. 1491 though. I *think* that means the difference is relatively small.

jampekka 7 hours ago | parent [-]

1491 vs 1418 ELO means the stronger model wins about 60% of the time.

supermatt 7 hours ago | parent [-]

Probably naive questions:

Does that also mean that Gemini-3 (the top ranked model) loses to mistral 3 40% of the time?

Does that make Gemini 1.5x better, or mistral 2/3rd as good as Gemini, or can we not quantify the difference like that?

esafak 7 hours ago | parent [-]

Yes, of course.