Remix.run Logo
hsn915 a day ago

I asked it a few questions and it responded exactly like all the other models do. Some of the questions were difficult / very specific, and it failed in the same way all the other models failed.

theptip 19 hours ago | parent [-]

Great example of this general class of reasoning failure.

“AI does badly on my test therefore it’s bad”.

The correct question to ask is, of course, what is it good at? (For bonus points, think in terms of $/task rather than simply being dominant over humans.)

atworkc 15 hours ago | parent [-]

"AI does badly on my test much like other AI's did before it, therefore I don't immediately see much improvement" is a fair assumption.

brookst 11 hours ago | parent | next [-]

No, it’s really not.

“I used an 8088 CPU to whisk egg whites, then an Intel core 9i-12000-vk4*, and they were equally mediocre meringues, therefore the latest Intel processor isn’t a significant improvement over one from 50 years ago”

* Bear with me, no idea their current naming

Kon-Peki 8 hours ago | parent [-]

You’re holding them wrong. An 8088 package should be able to emulate a whisk about a million times better than an i9.

theptip 9 hours ago | parent | prev [-]

“Human can’t fly, much like other humans. Therefore it’s bad”

Spot the problem now?

AI capabilities are highly jagged, they are clearly superhuman in many dimensions, and laughably bad compared to humans in others.