Wouldn't it be really weird if a open-weight model dropped in performance? Because then, it would rather be the Elo ranking