| ▲ | slashdave 4 hours ago | |
I was surprised by the ranking, until I read what the test was. Not horribly relevant for coding. The current ranking of all tests makes more sense (well, except for how well Gemini does) | ||
| ▲ | mpeg 2 hours ago | parent | next [-] | |
If you look at the ranking breakdown though, Kimi K2.6 has only participated in the last 5 challenges (claude dominated before then) and if you only count those it would be in first place | ||
| ▲ | SeriousM an hour ago | parent | prev | next [-] | |
The ranking of gold medals only makes sense if all models would gave participate all tests. DNP = Did not participate In this regard, kimi got more and better medals than Claude. | ||
| ▲ | dvfjsdhgfv an hour ago | parent | prev [-] | |
Well, the link you provided basically confirms Kimi's dominance. | ||