Remix.run Logo
Joel_Mckay 8 hours ago

The LLM activation capping only reduces aberrant offshoots from the expected reasoning models behavioral vector.

Thus, the hidden agent problem may still emerge, and is still exploitable within the instancing frequency of isomorphic plagiarism slop content. Indeed, LLM can be guided to try anything people ask, and or generate random nonsense content with a sycophantic tone. =3