Remix clone Hacker News

new | show | ask | jobs Github

	▲	Workaccount2 5 days ago
		Over fitting isn't evidence of non-reasoning, but that aside, what's interesting is that ChatGPT (free) trips on this, as did older models. But GPT-5 thinking, Opus 4, and Gemini 2.5 Pro all pointed out that there is no trick and it's likely the man just views it as a conflict of interest to interview his son. It's hard to say whether this has been trained out (it's an old example) or if it's just another hurdle that general model progression has overcome.