▲ | TeMPOraL 3 days ago | ||||||||||||||||
Scraping was a thing before LLMs, there's a whole separate arms race around this for regular competition and "industrial espionage" reasons. I'm not really sure why model training would become a noticeable fraction of scrapping activity - there's only few players on the planet that can afford to train decent LLMs in the first place, and they're not going to re-scrape the content they already have ad infinitum. | |||||||||||||||||
▲ | int_19h 3 days ago | parent [-] | ||||||||||||||||
> they're not going to re-scrape the content they already have That's true for static content, but much of it is forums and other places like that where the main value is that new content is constantly generated - but needs to be re-scraped. | |||||||||||||||||
|