Remix.run Logo
IcyWindows 2 hours ago

I'm trained on protected works. Do I need to pay royalties?

kube-system 2 hours ago | parent | next [-]

If you produce them verbatim or in significant enough portions, yes.

LatencyKills 2 hours ago | parent | prev [-]

> I'm trained on protected works.

That someone, at some point, paid for.

I'd like to understand why I can't use a song in one of my videos without permission/payment, but an AI company can train models using that song without having either.

I'm not anti-AI. I'd just like to see companies play by the rules everyone else has to follow.

echoangle an hour ago | parent | next [-]

> I'd like to understand why I can't use a song in one of my videos without permission/payment, but an AI company can train models using that song without having either.

Because training isn't redistribution.

You can also listen to the song and make a new one that sounds similar, just like the AI can.

LatencyKills an hour ago | parent [-]

To do that training, you must first obtain the item with the content you require. Did OpenAI purchase a copy of every book they trained their models on?

Answer: They did not. That is literally why there are dozens of ongoing lawsuits in progress.

echoangle an hour ago | parent [-]

For songs, it's not that hard to legally get access to it, I think. I'm not sure if Spotify can legally prevent you from using songs for AI training for example.

CamperBob2 2 hours ago | parent | prev | next [-]

I'd like to understand why I can't use a song in one of my videos without permission/payment, but an AI company can train models using that song without having either.

You're right, it's an unjust situation. And you may note that no one else besides the AI companies has made any progress at all towards changing it.

Copyright will soon die, having outlived its usefulness to society. Whether the knife is held by someone named Stallman or someone named Altman is of little consequence.

JimDabell 2 hours ago | parent | prev [-]

> I'd like to understand why I can't use a song in one of my videos without permission/payment, but an AI company can train models using that song without having either.

Because when you say you are “using” the song, what you mean is that you are distributing copies of the song, which is protected by copyright.

When AI companies train on the song, the model is learning from it. Outside of the rare cases of memorisation, this is not distributing copies and so copyright doesn’t have any say in the matter.

Learning isn’t copying, so copyright doesn’t get involved at all.

LatencyKills 2 hours ago | parent [-]

I appreciate your comment, but you answered as if this question had been answered legally. It has not.

The New York Times is suing both OpenAI and Microsoft for copyright infringement. The Authors Guild is suing OpenAI. Getty Images is suing Stability AI. Disney is suing Midjourney. Universal Music Group and Sony have filed suits against multiple AI companies.

> so copyright doesn’t get involved at all.

The dozens of ongoing cases that discredit that statement.

JimDabell an hour ago | parent [-]

Which statement of mine do you think is not settled law? Which law do you think is being broken and how?

Your objection doesn’t make sense. In the event that an AI company loses a lawsuit for copyright infringement based on simply training on copyrighted works, the answer to you saying you’d like to understand why they can do it and you can’t is simply “your premise is wrong; neither of you can”.

LatencyKills an hour ago | parent [-]

> Which statement of mine do you think is not settled law?

I object to your statement that "copyright doesn’t get involved at all" when that is objectively untrue. If that was true, many of the world's largest companies wouldn't be spending tens of millions of dollars to have that question answered in court. Go to any law-focused forum, and you will find attorneys arguing over these questions.

To train a model using a book, you must first obtain a copy of that book. Did OpenAI purchase a copy of every book not already in the public domain used during training? They did not.

Some of the suits I mentioned claim that OpenAI literally stole copies of books to train its models.

My point is that the copyright question has not been answered. If the NYT, et. al. win, it will be a watershed moment for how AI companies pay for training data moving forward.