▲ | kiitos 4 days ago | |||||||||||||||||||||||||
> Anthropic's more technical users inherently understand how LLMs work. good (if superficial) post in general, but on this point specifically, emphatically: no, they do not -- no shade, nobody does, at least not in any meaningful sense | ||||||||||||||||||||||||||
▲ | omnicognate 4 days ago | parent | next [-] | |||||||||||||||||||||||||
Understanding how they work in the sense that permits people to invent and implement them, that provides the exact steps to compute every weight and output, is not "meaningful"? There is a lot left to learn about the behaviour of LLMs, higher-level conceptual models to be formed to help us predict specific outcomes and design improved systems, but this meme that "nobody knows how LLMs work" is out of control. | ||||||||||||||||||||||||||
| ||||||||||||||||||||||||||
▲ | lukev 4 days ago | parent | prev | next [-] | |||||||||||||||||||||||||
If we are going to create a binary of "understand LLMs" vs "do not understand LLMs", then one way to do it is as you describe; fully comprehending the latent space of the model so you know "why" it's giving a specific output. This is likely (certainly?) impossible. So not a useful definition. Meanwhile, I have observed a very clear binary among people I know who use LLMs; those who treat it like a magic AI oracle, vs those who understand the autoregressive model, the need for context engineering, the fact that outputs are somewhat random (hallucinations exist), setting the temperature correctly... | ||||||||||||||||||||||||||
| ||||||||||||||||||||||||||
▲ | shloked 4 days ago | parent | prev | next [-] | |||||||||||||||||||||||||
I should've been clearer, but what I meant was language models 101. Normal people don't understand even basics like LLMs are stateless by default and need to be given external information to "remember" things about you. Or, what is a system prompt. | ||||||||||||||||||||||||||
▲ | kingkawn 4 days ago | parent | prev [-] | |||||||||||||||||||||||||
Thanks for this generalization, but of course there is a broad range of understanding how to improve usefulness and model tweaks across the meat populace. |