| ▲ | adastra22 5 hours ago | ||||||||||||||||
> if it's not public, presumably LLMs would never get better at them. Why? This is not obvious to me at all. | |||||||||||||||||
| ▲ | gregsadetsky 4 hours ago | parent [-] | ||||||||||||||||
You're correct of course - LLMs may get better at any task of course, but I meant that publishing the evals might (optimistically speaking) help LLMs get better at the task. If the eval was actually picked up / used in the training loop, of course. | |||||||||||||||||
| |||||||||||||||||