Remix.run Logo
ilmj8426 2 hours ago

It's impressive to see how fast open-weights models are catching up in specialized domains like math and reasoning. I'm curious if anyone has tested this model for complex logic tasks in coding? Sometimes strong math performance correlates well with debugging or algorithm generation.

alansaber 4 minutes ago | parent | next [-]

It makes complete sense to me: highly-specific models don't have much commercial value, and at-scale llm training favours generalism.

stingraycharles 7 minutes ago | parent | prev [-]

kimi-k2 is pretty decent at coding but it’s nowhere near the SOTA models of Anthropic/OpenAI/Google.