| ▲ | rytill 2 hours ago | |
LLMs are not "average text generation machines" once they have context. LLMs learn a distribution. The moment you start the prompt with "You are an interactive CLI tool that helps users with software engineering at the level of a veteran expert" you have biased the LLM such that the tokens it produces are from a very non-average part of the distribution it's modeling. | ||