| ▲ | cadamsdotcom 10 hours ago | |||||||
> An LLM's primary modality isn't smell. It's... text. But, specifically: text in the context of a user-assistant conversation in which it's trying to be helpful. Text is how they learned about everything they know, and the user-assistant chatlog is how they communicate everything they generate This is true for instruction-tuned models; but instruction tuning is late in the training process. A bit like assessing a person’s self-awareness based on their high-school knowledge. | ||||||||
| ▲ | thepasch 10 hours ago | parent [-] | |||||||
Very true, and something worth mentioning. Papers that tried eliciting introspective language from base models with no post-training have largely failed to find any patterns or activations that look similar to those found in instruct models when prompted for the same thing. I did sort of touch on it in the "what does this mean" section: > *post-training* installs a self-model with actual, meaningful boundaries, and when processing falls outside those boundaries, the first-person pronoun no longer binds to the content. But you're right I could've been more explicit about it. | ||||||||
| ||||||||