Remix.run Logo
UltraSane 4 hours ago

Even people benefit from multiple tries over time.

chris_st 4 hours ago | parent [-]

Well, to be fair, people cheat by remembering what they did last time. I think the idea here is to run the models from a "clean slate" and see how often they succeed/fail.

They are, like people, non-deterministic, so giving them several "fair" trials makes sense to me.