They must have really hand picked those results, gpt4 would have been full of annoying emojis as bullet points and emdashes.
GPT 4o ≠ GPT-4
Maybe they should train a model to give these models more useful names.