Remix.run Logo
computerex 8 hours ago

They are all autoregressive. They have just been trained to emit thinking tokens like any other tokens.