Remix.run Logo
ben_w 3 days ago

> Besides lots of GPU's, training data seems the most valuable asset AI companies have. Sounds like strong incentive to me to secretly use it anyway. Who would really know, if the pipelines are set up in a way, if only very few people are aware of this?

Could be, but it's a huge risk the moment any lawsuit happens and the "discovery" process starts. Or whistleblowers.

They may well take that risk, they're clearly risk-takers. But it is a risk.

yunwal 3 days ago | parent | next [-]

Eh they’re all using copyrighted training data from torrent sites anyway. If the government was gonna hold them accountable for this it would have happened already.

ragequittah 3 days ago | parent | next [-]

You're probably right [1]

[1]https://www.cbc.ca/news/business/anthropic-ai-copyright-sett...

ben_w 3 days ago | parent | prev [-]

The piracy was found to be unlawful copyright infringement.

The training was OK, but the piracy wasn't, they were held accountable for that.

blibble 3 days ago | parent | prev [-]

the US no longer has any form of rule of law

so there's no risk

ben_w 3 days ago | parent | next [-]

The USA is a mess that's rapidly getting worse, but it has not yet fallen that far.

Aurornis 3 days ago | parent | prev [-]

> the US no longer has any form of rule of law

AI threads really bring out the extreme hyperbole and doomerism.