| ▲ | red75prime 8 months ago | |
LLMs are doing what you train them to do. See for example " The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions " by Eric Wallace et al. | ||
| ▲ | MattPalmer1086 8 months ago | parent [-] | |
Interesting. Doesn't solve the problem entirely but seems to be a viable strategy to mitigate it somewhat. | ||