Remix.run Logo
godelski 20 hours ago

There is already exemptions for research. Look at licensing around things like ImageNet. There's similar licensing around things like LAION and Common Crawl[0] It's also not legal to just scrape everything without paying. There's a reason the NYT sued OpenAI and then got a settlement. It's still illegal for Meta to torrent terabytes of textbooks too.

[0] https://commoncrawl.org/terms-of-use

  > In this regard, you acknowledge that you may not rely on any Crawled Content created or accumulated by CC.  CC strongly recommends that you obtain the advice of legal counsel before making any use, including commercial use, of the Service and/or the Crawled Content.  BY USING THE CRAWLED CONTENT, YOU AGREE TO RESPECT THE COPYRIGHTS AND OTHER APPLICABLE RIGHTS OF THIRD PARTIES IN AND TO THE MATERIAL CONTAINED THEREIN.