Remix.run Logo
vidarh 2 hours ago

Kimi 2.6 is nowhere near even Sonnet in overall robustness. It can get close when everything goes perfectly.

I have about 1KLOC of harness code written by Kimi to work around quirks in Kimi not needed for any other model I've tested, such as infinite toolcall loops and other weirdness.

You can do quite a bit with it and never run into those quirks, or you might hit it every request.

It is very sensitive to "confusing" things about it's environment in a way Sonnet and Opus are not.

Still great value, but they have some way to go.