Remix.run Logo
Tanjreeve 4 days ago

This reads like you're ridiculing people for being proved right?

ripped_britches 3 days ago | parent [-]

No the point of the comment is that there is no meaningful difference between model performance improvements from before and after this news of a benchmark weakness (spoiler alert, almost all of the benchmarks contain serious problems). The models are improving every quarter whether HN likes it or not.