Remix.run Logo
simonw an hour ago

It's impossible to answer if you don't have a search tool, and three out of the five tested models didn't have a search tool.

xyzzy123 an hour ago | parent [-]

Thanks; I didn't spot that they disabled tools in the harness. Also they don't provide an "out" to allow the models to express uncertainty so the instructions force a guess to be made.

As an aside though it's still funny that the two tools WITH search also disagreed.