Remix.run Logo
throw10920 5 hours ago

I've used Opus extensively and tried K2.6 on a few projects, and the gap is huge. K2.6 is nowhere near the performance of Opus. That's fine because it's also far cheaper, but public benchmarks line up with my own personal experience that they aren't comparable in terms of intelligence.

(that is, different places on the Pareto efficiency graph)

abalashov 2 hours ago | parent [-]

No two uses are alike, I suppose. For me, whatever difference is a wash. However, I probably tend to shy away from throwing high-complexity/long-horizon tasks at the model.