Remix.run Logo
lacunary 4 hours ago

presumably whatever the top model uses and then some, since the human can use the model.

I wonder if a model could score higher if it had a human at its disposal?

olmo23 an hour ago | parent | next [-]

With a human at its disposal, it could probably count the number of R's in strawberry!

In all seriousness though, adding capabilities should not normally reduce the effectiveness of a model (within reason: don't pollute the context window with millions of useless tools).

pishpash 2 hours ago | parent | prev [-]

Maybe models should ask for human-in-the-loop input, as a matter of convention.

sinuhe69 an hour ago | parent [-]

A model that can ask questions or ask for help when in doubt is indeed a major feat. None of the current frontier models can do that.