Remix.run Logo
lyfeninja 2 days ago

Below is the "Attention is all you need" paper. Transformers and their attention mechanism was the major breakthrough for modern LLMs. ML has been around for a long time, I'd suggest joining kaggle or something and learn by doing. You'll retain more and realize how broad the category is anymore.

https://arxiv.org/abs/1706.03762