| ▲ | _heimdall 12 hours ago | |
I want to ask what makes them magic, but even those building LLMs don't really know what happens when they run inference... I have to assume current architectures aren't optimal though, the idea that we stumbled into the one and only optimal solution seems almost impossible. | ||