Mostly. It gives language models the way to dynamically allocate computation time, but the models are still fundamentally imitative.