The steelman argument - not to say that I agree with it - is that this capability was present all along, but due to variability in the behavior of the model, adding certain context, such as chain-of-thought, exposes it.