Remix.run Logo
wasfgwp 6 hours ago

> high as the second best general purpose model

According to benchmarks which are gamed to the extreme these days. Trusting them blindly isn’t exactly rational either. They don’t necessarily translate that well to real world tasks

It’s obviously not “distilling” as such but there are reasons why Chinnese models are consistently several months behind OpenAI/Antropic

2ndorderthought 6 hours ago | parent [-]

[dead]