Remix.run Logo
xmcqdpt2 3 hours ago

You can understand how transformers work from just reading the Attention is All You Need paper, which is 15 pages of pretty accessible DL. That's not the part that is impressive about LLMs.