| ▲ | A Visual Guide to Attention Variants in Modern LLMs(magazine.sebastianraschka.com) | |
| 13 points by Anon84 12 hours ago | 1 comments | ||
| ▲ | nv2156 6 hours ago | parent [-] | |
Great read about the technical evidence around the shift from better attention to better serving of models. Just came across a companion piece around this https://news.ycombinator.com/item?id=47388676 | ||