| ▲ | KV Sharing, MHC, and Compressed Attention(magazine.sebastianraschka.com) | |
| 20 points by gmays 4 hours ago | 1 comments | ||
| ▲ | nibab an hour ago | parent [-] | |
cool stuff. my comp sci major feels almost completely redundant in this new vibecoding era and i feel like the only way to stay relevant as a programmer is to learn all these compute primitives and become an LLM systems guy. | ||