| ▲ | measurablefunc 7 hours ago | |||||||
What about the other direction? What languages are expressible w/ RNNs & LTLs that require exponential blowup for transformers? | ||||||||
| ▲ | aesthesia 6 hours ago | parent | next [-] | |||||||
Not quite an answer to your question, but you might find this interesting. The Olmo Hybrid paper has some results on relative complexity of problems that can be solved by transformers and RNNs. They don't look at size, just solvability, and find that the sets of problems solvable by the two architectures are incomparable. They actually use these results to inform their architecture design, which includes both attention and state space layers. (Specifically, they choose gated delta-nets with negative eigenvalues, which they show have greater expressivity than those without.) | ||||||||
| ▲ | yorwba 7 hours ago | parent | prev [-] | |||||||
Proposition 16. UHATs have polynomially bounded expansion over LTL. In particular, given an LTL formula φ, one can construct in polynomial time a UHAT T such that L(T) = L(φ). i.e. the blowup is only exponential in one direction. | ||||||||
| ||||||||