Remix.run Logo
vova_hn2 a day ago

> I can distinguish my own thoughts from your speech without effort; they arrive through completely different channels with completely different sensory signatures. But for an LLM, everything arrives through the same channel as one long token soup. Its own thoughts sit next to your instructions, which sit next to the contents of a random webpage it just fetched.

I was thinking about the original encoder-decoder transformers, that did have separate channels for input and their own output.

Why can't we bring it back? For example, one channel for system prompt and another for everything else.