| ▲ | lmf4lol 18 hours ago | |||||||||||||||||||||||||||||||
I really dont want to be cynic but those guys gave a flying f””” about copyright while scraping the whole internet. How can I ever trust them to respect the oot-out setting. I cant. Thieves be thieves. And even if they dont train on the data. Who guarantees us, they dont let another AI model analyse all the data, exfiltrating all kinds of intelligence and using it? I only can imagine what OpenAI and Anthropic know…. | ||||||||||||||||||||||||||||||||
| ▲ | astrange 17 hours ago | parent [-] | |||||||||||||||||||||||||||||||
Scraping the internet isn't a copyright violation. Using it for LLM training is much more transformative than Google and Internet Archive, which are legal. | ||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||