| ▲ | Chance-Device 5 hours ago | |||||||||||||||||||
Let’s see, I think these pretty much map out a little chronology of the research: https://arxiv.org/abs/2112.00114 https://arxiv.org/abs/2406.06467 https://arxiv.org/abs/2404.15758 https://arxiv.org/abs/2512.12777 First that scratchpads matter, then why they matter, then that they don’t even need to be meaningful tokens, then a conceptual framework for the whole thing. | ||||||||||||||||||||
| ▲ | bsza 4 hours ago | parent [-] | |||||||||||||||||||
I dont’t see the relevance, the discussion is over whether boilerplate text that occurs intermittently in the output purely for the sake of linguistic correctness/sounding professional is of any benefit. Chain of thought doesn’t look like that to begin with, it’s a contiguous block of text. | ||||||||||||||||||||
| ||||||||||||||||||||