| ▲ | parineum 3 days ago | ||||||||||||||||
Everything is just trivia until you have a use for the answer. OP provided a we link with the answer, aren't these models supposed to be trained on all of that data? | |||||||||||||||||
| ▲ | esafak 3 days ago | parent | next [-] | ||||||||||||||||
There is nothing useful you can do with this information. You might as well memorize the phone book. The model has a certain capacity -- quite limited in this case -- so there is an opportunity cost in learning one thing over another. That's why it is important to train on quality data; things you can build on top of. | |||||||||||||||||
| |||||||||||||||||
| ▲ | DennisP 3 days ago | parent | prev [-] | ||||||||||||||||
Just because it's in the training data doesn't mean the model can remember it. The parameters total 60 gigabytes, there's only so much trivia that can fit in there so it has to do lossy compression. | |||||||||||||||||