Remix.run Logo
steve1977 3 hours ago

The question of course is, did it get the car wash question right because it is "the car wash question" or because it could actually infer why the car needed to be there?

embedding-shape 3 hours ago | parent | next [-]

Wasn't that "twoot" (or whatever Mastodon calls them) made just a week ago? Unlikely to have been in the training dataset of a model becoming available for public use today, unless Google made some serious advancements on the training front.

jama211 3 hours ago | parent | prev [-]

Shouldn’t be too hard to come up with a new unique reasoning question