| ▲ | Oras 6 hours ago | |
Hard time? What value does adult videos description, views and comments add to small (7,32B) models? | ||
| ▲ | andy99 6 hours ago | parent | next [-] | |
It says it’s common crawl, I interpret it to mean this is a generic web scrape dataset, presumably they filter stuff out they don’t want before pretraining. You’d have to do do some ablation testing to know what value it adds | ||
| ▲ | khimaros 4 hours ago | parent | prev [-] | |
what if that's where they learned how to utilize the double entendre? hard times indeed. | ||