| ▲ | AlotOfReading 7 hours ago | |
No, this is in the same ballpark as ideas like big-O notation. The paper is saying that transformers can recognize a language with exponentially fewer symbols than other kinds of systems, i.e. they're more succinct. It's exactly as related to real models as computer science is to real computers. | ||