▲ | yorwba 5 days ago | |||||||
The argument isn't that anything is possible for LLMs, but that representing LLMs as Markov chains doesn't demonstrate a limitation, because the resulting Markov chain would be huge, much larger than the LLM, and anything that is possible is possible with a large enough Markov chain. If you limit yourself to Markov chains where the full transition matrix can be stored in a reasonable amount of space (which is the kind of Markov chain that people usually have in mind when they think that Markov chains are very limited), LLMs cannot be represented as such a Markov chain. If you want to show limitations of LLMs by reducing them to another system of computation, you need to pick one that is more limited than LLMs appear to be, not less. | ||||||||
▲ | ariadness 5 days ago | parent [-] | |||||||
> anything that is possible is possible with a large enough Markov chain This is not true. Do you mean anything that is possible to compute? If yes than you missed the point entirely. | ||||||||
|