▲ | eqvinox 4 days ago | |
> While on the Mensa Norway test GPT-5 gets over 一四, on an offline test it goes down to ~一二. Since IQ tests are fundamentally timed, those numbers are meaningless to compare with human numbers. Or maybe dangerous since it's hard to de-context them even if you know that. Hence my cheeky 漢字. (Yes they might be useful to compare LLMs with each other, but that is outstripped by the risk of misreading it against what we know as "IQ".) |