Remix.run Logo
AlotOfReading 7 hours ago

No, this is in the same ballpark as ideas like big-O notation. The paper is saying that transformers can recognize a language with exponentially fewer symbols than other kinds of systems, i.e. they're more succinct.

It's exactly as related to real models as computer science is to real computers.