Remix.run Logo
dbreunig 4 days ago

Wrote about this about a month ago. I think it’s fascinating how they developed these prompts: https://www.dbreunig.com/2025/07/05/cat-facts-cause-context-...

dbreunig 4 days ago | parent [-]

A similar, fun case is where researchers inserted facts about the user (gender, age, sports fandom) and found alignment rules were inconsistently applied: https://www.dbreunig.com/2025/05/21/chatgpt-heard-about-eagl...

nyrikki 4 days ago | parent [-]

If you map LLM/LRMs to Norvig's Model based reflex agents, wouldn't this be expected behavior?