Wrote about this about a month ago. I think it’s fascinating how they developed these prompts: https://www.dbreunig.com/2025/07/05/cat-facts-cause-context-...

A similar, fun case is where researchers inserted facts about the user (gender, age, sports fandom) and found alignment rules were inconsistently applied: https://www.dbreunig.com/2025/05/21/chatgpt-heard-about-eagl...

	▲	nyrikki 4 days ago \| parent [-]
		If you map LLM/LRMs to Norvig's Model based reflex agents, wouldn't this be expected behavior?