| ▲ | 100ms 4 hours ago | ||||||||||||||||
Tiny model overfit on benchmark published 3 years prior to its training. News at 10 | |||||||||||||||||
| ▲ | selimthegrim 3 hours ago | parent | next [-] | ||||||||||||||||
It wasn't important enough to make the 11 o'clock program. | |||||||||||||||||
| ▲ | bigyabai 4 hours ago | parent | prev | next [-] | ||||||||||||||||
But GPT-3.5 was benchmaxxing too. | |||||||||||||||||
| |||||||||||||||||
| ▲ | srslyTrying2hlp 4 hours ago | parent | prev [-] | ||||||||||||||||
[dead] | |||||||||||||||||