Isn’t this just a benchmark?
“Model can count to 5”… tick.
“Model can count to 10”… sorry you gotta wait til 2028.