Remix.run Logo
culi 2 hours ago

The point here is to test for "genuine reasoning" or something approaching it. If a model is truly reasoning it should be competent even in a new language you just made up (provided the language itself is competently designed)

wehnsdaefflae 2 hours ago | parent [-]

So humans don't do "genuine reasoning"?