Remix.run Logo
justinclift 2 hours ago

> It hallucinates a lot more then Sonnet or even MiniMax M2.5.

Ugh, that's not good.

I evaluated Kimi K2 a while back for some text understanding -> summarisation tasks, and of the 100 tasks it hallucinated about 30% of the output. :( :( :(