[flagged]

CJefferson 2 years ago | parent | next [-]

We can, and do, choose to treat normal people different from billion dollar companies that are attempting to suck up all human output and turn it into their own personal profit.

If they were, say, a charity doing this for the good of mankind, I’d have more sympathy. Shame they never were.

	▲	tolmasky 2 years ago \| parent [-]
		The way to treat them differently is not by making them share profits with another corporation. The logical endgame of all this isn’t “stopping LLMs,” it’s Disney happening to own a critical mass of IP to be able to legally train and run LLMs that make movies, firing all their employees, and no smaller company ever having a chance in hell with competing with a literal century’s worth of IP powering a generative model. The best party about all this is that Disney initially took off by… making use of public domain works. Copyright used to last 14 years. You’d be able to create derivative works of most the art in your life at some point. Now you’re never allowed to. And more often than not, not to grant a monopoly to the “author”, but to the corporation that hired them. The correct analysis shouldn’t be OpenAI vs. Intercept or Disney of whomever. You’re just choosing kings at that point.

▲

IsTom 2 years ago | parent | prev | next [-]

> produced "a unique" song?

People do get sued for making songs that are too similar to previously made songs. One defence available is that they've never heard it themselves before.

If you want to treat AI like humans then if AI output is similar enough to copyrighted material it should get sued. Then you try to prove that it didn't ingest the original version somehow.

▲

noitpmeder 2 years ago | parent [-]

The fact that these lawsuits aren't as simple as "is my copywrited work in your training set, yes or no" is boggling.

	▲	__loam 2 years ago \| parent [-]
		I feel like at some point the people in favor of this are going to realize that whether the data was ingested into a training set is completely immaterial to the fact that these companies downloaded data they don't have a license to use to a company server somewhere with the intention to use it for commercial use.

▲

GeoAtreides 2 years ago | parent | prev | next [-]

Ah yes, humans and LLMs are exactly the same, learning the same way, reasoning the same way, they're practically indistinguishable. So that's why it makes sense to equate humans reading books with computer programs ingesting and processing the equivalent of billions of books in literal days or months.

▲

Timwi 2 years ago | parent [-]

While I agree with your sentiment in general, this thread is about the legal situation and your argument is unfortunately not a legal one.

▲

anileated 2 years ago | parent [-]

“A person is fundamentally different from an LLM” does not need a legal argument and is implied by the fact that LLMs do not have human rights, or even anything comparable to animal rights.

A legal argument would be needed to argue the other way. This argument would imply granting LLMs some degree of human rights, which the very industry profiting from these copyright violations will never let happen for obvious reasons.

▲

notahacker 2 years ago | parent [-]

The other problem with the legal argument that it's "just like a person learning" is that corporations whose human employees have learned what copyrighted characters look like and then start incorporating them into their art are considered guilty of copyright violation, and don't get to deploy the "it's not an intentional copyright violation from someone who should have known better, it's just a tool outputting what the user requested" defence...

	▲	anileated 2 years ago \| parent [-]
		Exactly. Also, it is only a matter of time until one of those employees (thanks to free will and agency) will whistleblow, it doesn’t scale, etc. Frankly, the fact that such a big segment of HN crowd unthinkingly buys big tech’s double standard (LLMs are human when copyright is concerned, but not human in every other sense) makes me ashamed of the industry.

▲

mongol 2 years ago | parent | prev | next [-]

The process of reading it into their training data is a way of copying it. It exists somewhere and they need to copy it in order to ingest it.

▲

wvenable 2 years ago | parent [-]

By that logic you're violating copyright by using a web browser.

▲

Suppafly 2 years ago | parent | next [-]

>By that logic you're violating copyright by using a web browser.

You would be except for the fact that publishing stuff on the web gives people an implicit license to download it for the purposes of viewing it.

▲

Timwi 2 years ago | parent | next [-]

Not sure about US or other jurisdictions, but that's not how any of this works in Germany. In Germany downloading anything from anywhere (even a movie) is never illegal and does not require a license. What's illegal is publishing/disseminating copyrighted content without authorization. BitTorrenting a movie is illegal because you're distributing it to other torrenters. Streaming a movie on your website is illegal because it's public. You can be held liable for using a photo from the web to illustrate your eBay auction, not because you downloaded it but because you republished it.

OpenAI (and Google and everyone else) is creating a publicly-accessible system that produces output that could be derived from copyrighted material.

	▲	Suppafly 2 years ago \| parent \| next [-]
		I think it works like that in Canada and some other places too, because they pay an extra tax on storage media when they buy it, which essentially authorizes a license for any copyrighted material that might be stored on that media.
	▲	Tomte 2 years ago \| parent \| prev [-]
		> In Germany […] That‘s confidently and completely wrong.

▲

wvenable 2 years ago | parent | prev [-]

I'm only allowed to view it? I can't download it, convert each word into a color, and create a weird piece of art work out of it? I think I can.

	▲	Suppafly 2 years ago \| parent [-]
		>convert each word into a color, and create a weird piece of art work out of it? I think I can. I agree, but the original author might get butthurt if you distribute it. Realistically copyright law in the US is a mess when it comes to weird pieces of art.

▲

__loam 2 years ago | parent | prev [-]

The nature of the copy does actually matter.

▲

DrillShopper 2 years ago | parent | prev | next [-]

> You read books and now you have a job? Pay up.

It is disingenuous to imply the scale of someone buying books and reading them (for which the publisher and author are compensated) or borrowing them from the library and reading them (again, for which the publisher and author are compensated) is the same as the wholesale copying without permission or payment of anything not behind a pay wall on the Internet.

▲

2 years ago | parent | prev [-]

[deleted]