Remix.run Logo
NoahZuniga 4 days ago

The 135 iq result is on Mensa Norway, while the offline test is 120. It seems probable that similar questions to the one in Mensa are in the training data, so it probably overestimates "general intelligence".

TrackerFF 4 days ago | parent | next [-]

Some iq / aptitude test sections are trivial for machines, like working memory. Wonder if those parts are just excluded? As the could really pull up the test scores.

cman1444 3 days ago | parent [-]

If they are excluded, we should be calling the score something other than just "IQ". I don't think we should be moving the goal posts for some testers (machines) just because they are significantly better at some types of questions than other testers (humans).

starchild3001 4 days ago | parent | prev [-]

If you focus on the year over year jump, not on absolute numbers, you realize that the improvement in public test isn't very different from the improvement in private test.