Remix.run Logo
DonaldPShimoda 3 hours ago

Right — they're not reasoning, they're generating text that statistically models reasoning. Anyone who says differently is selling something.

jeremyjh an hour ago | parent [-]

That is what a base model does. After RL it is a very different thing, and anyone who says they know what it is, is naive or dishonest. These things are grown, not made, and we really do not understand how they work in many important ways.