It would be interesting to see if that same question against GPT-5 Thinking produces notably better results.