| ▲ | mceachen 4 hours ago | |
This is a very recent model behavior change: for me, Opus 4.6, Gemini 3.1 Pro, and ChatGPT 5.4(ish) -- prior models and harnesses suffered much more from sycophancy. (I still prompt some questions and reviews with "our intern suggested..." to allow models to judge the quality of the content apart from the messenger) | ||