Remix.run Logo
slashdave 4 hours ago

I was surprised by the ranking, until I read what the test was. Not horribly relevant for coding.

The current ranking of all tests makes more sense (well, except for how well Gemini does)

https://aicc.rayonnant.ai

mpeg 2 hours ago | parent | next [-]

If you look at the ranking breakdown though, Kimi K2.6 has only participated in the last 5 challenges (claude dominated before then) and if you only count those it would be in first place

SeriousM an hour ago | parent | prev | next [-]

The ranking of gold medals only makes sense if all models would gave participate all tests.

DNP = Did not participate

In this regard, kimi got more and better medals than Claude.

dvfjsdhgfv an hour ago | parent | prev [-]

Well, the link you provided basically confirms Kimi's dominance.