| ▲ | odo1242 3 hours ago | |||||||
Claude Opus 4.6 is the best possible model to use in this test, with the least sycophancy. OpenAI and Gemini models are bad in comparison. | ||||||||
| ▲ | mkozlows 3 hours ago | parent [-] | |||||||
ChatGPT thinking models are very good; the instant model is bad. Gemini is always desperate to find an answer, and will give you one no matter what. | ||||||||
| ||||||||