I wonder if this puts into question the mythos benchmark which smashed basically all coding benchmarks to a staggering degree.