Remix.run Logo
stale2002 5 hours ago

ok! So if someone uses an existing, checkpointed, open source model then the answer is yes the results are valid and it doesn't matter that the tests are public.

measurablefunc 5 hours ago | parent [-]

Yes, assuming the checkpoint was before the announcement & public availability of the test set.