Remix.run Logo
The State of GPL Propagation to AI Models(shujisado.org)
85 points by jonymo 3 hours ago | 96 comments
Orygin 34 minutes ago | parent | next [-]

Great article but I don't really agree with their take on GPL regarding this paragraph:

> The spirit of the GPL is to promote the free sharing and development of software [...] the reality is that they are proceeding in a different vector from the direction of code sharing idealized by GPL. If only the theory of GPL propagation to models walks alone, in reality, only data exclusion and closing off to avoid litigation risks will progress, and there is a fear that it will not lead to the expansion of free software culture.

The spirit of the GPL is the freedom of the user, not the code being freely shared. The virality is a byproduct to ensure the software is not stolen from their users. If you just want your code to be shared and used without restrictions, use MIT or some other license.

> What is important is how to realize the “freedom of software,” which is the philosophy of open source

Freedom of software means nothing. Freedoms are for humans not immaterial code. Users get the freedom to enjoy the software how they like. Washing the code through an AI to purge it from its license goes against the open source philosophy. (I know this may be a mistranslation, but it goes in the same direction as the rest of the article).

I also don't agree with the arguments that since a lot of things are included in the model, the GPL code is only a small part of the whole, and that means it's okay. Well if I take 1 GPL function and include it in my project, no matter its size, I would have to license as GPL. Where is the line? Why would my software which only contains a single function not be fair use?

palata 38 minutes ago | parent | prev | next [-]

Genuine question: if I train my model with copyleft material, how do you prove I did?

Like if there is no way to trace it back to the original material, does it make sense to regulate it? Not that I like the idea, just wondering.

I have been thinking for a while that LLMs are copyright-laundering machines, and I am not sure if there is anything we can do about it other than accepting that it fundamentally changes what copyright is. Should I keep open sourcing my code now that the licence doesn't matter anymore? Is it worth writing blog posts now that it will just feed the LLMs that people use? etc.

freedomben 12 minutes ago | parent | next [-]

I've thought about this as well, especially for the case when it's a company owned product that is AGPLed. It's a really tough situation, because the last thing we want is competitors to come in and LLM wash our code to benefit their own product. I think this is a real risk.

On the other side, I deeply believe in the values of free software. My general stance is that all applications I open source are GPL or AGPL, and any libraries I open source are MIT. For the libraries, obviously anyone is free to use them, and if they want to rewrite them with an LLM more power to them. For the applications though, I see that as a violation of the license.

At the end of the day, I have competing values and needs and have to make a choice. The choice I've made for now is that for the vast majority of things, I'm still open sourcing them. The gift to humanity and the guarantee to the users freedom is more important to me than a theoretical threat. The one exception is anything that is truly a risk of getting lifted and used directly by competitors. I have not figured out an answer to this one yet, so for now I'm keeping it AGPL but not publicly distributing the code. I obviously still make the full code available to customers, and at least for now I've decided to trust my customers.

I think this is an issue we have to take week by week. I don't want to let fear of things cause us to make suboptimal decisions now. When there's an actual event that causes a reevaluation, I'll go from there.

basilgohar 29 minutes ago | parent | prev | next [-]

Maybe we should requiring training data be published or at least referenced.

mistrial9 29 minutes ago | parent | prev [-]

> Should I keep open sourcing my code now that the licence doesn't matter anymore?

your LICENSE matters in similar ways that it mattered before LLMs. LICENSE adherence is part of intellectual property law and practice. A popular engine may be popular, but not all cases at all times. Do not despair!

graemep 2 hours ago | parent | prev | next [-]

The article repeatedly treats license and contract as though they are the same, even though the sidebar links to a post that discusses the difference.

A lot of it boils down to whether training an LLM is a breach of copyright of the training materials which is not specific to GPL or open source.

maxloh an hour ago | parent | next [-]

To my understanding, if the material is publicly available or obtained legally (i.e., not pirated), then training a model with it falls under fair use.

Once training is established as fair use, it doesn't really matter if the license is MIT, GPL, or a proprietary one.

blibble an hour ago | parent | next [-]

fair use only applies in the united states (and Poland, and a very limited set of others)

https://en.wikipedia.org/wiki/Fair_use#/media/File:Fair_use_...

and it is certainly not part of the Berne Convention

in almost every country in the world even timeshifting using your VCR and ripping your own CDs is copyright infringement

jcelerier an hour ago | parent | next [-]

France and most of europe has fair use (https://fr.wikipedia.org/wiki/Copie_priv%C3%A9e) but also has a mandatory tax on every sold medium that can do storage to recover the "lost fees" due to fair use

RobotToaster 27 minutes ago | parent | prev [-]

Most commonwealth countries have fair dealing, which is similar although slightly different https://en.wikipedia.org/wiki/Fair_dealing

blibble 16 minutes ago | parent [-]

importantly "fair dealing" has no concept of "transformation"

(which is the linch-pin of the sloppers)

mongol an hour ago | parent | prev | next [-]

> To my understanding, if the material is publicly available or obtained legally (i.e., not pirated), then training a model with it falls under fair use.

Is this legally settled?

graemep an hour ago | parent | prev [-]

That is just the sort of point I am trying to make. That is a copyright law issue, not a contractual one. If the GPL is a contract then you are in breach of contract regardless of fair use or equivalents.

xgulfie 2 hours ago | parent | prev | next [-]

And the current norm that the trillion dollar companies have lobbied for is that you can train on copyrighted material all you want so that's the reality we are living in. Everything ever published is all theirs.

graemep an hour ago | parent | next [-]

I am really surprised that media businesses, which are extremely influential around the world, have not pushed back against this more. I wonder whether they are looking at cost savings that will get from the technology as a worthwhile trade-off.

gorbachev an hour ago | parent [-]

They're busy trying to profit from it rushing to enter into licensing agreements with the LLM vendors.

rileymat2 31 minutes ago | parent | prev | next [-]

All theirs, if they properly obtained the copy.

This is a big difference that already has bit them.

exasperaited an hour ago | parent | prev [-]

In practice it wouldn't matter a whit if they lobbied for it or not.

Lobbying is for people trying to stop them; externalities are for the little people.

kronicum2025 an hour ago | parent | prev | next [-]

A GPL license is a contract in most other countries. Just not US probably.

graemep 38 minutes ago | parent [-]

That part of the article is about US cases, so its US law that applies.

> A GPL license is a contract in most other countries. Just not US probably.

Not just the US. It may vary with version of the GPL too. Wikipedia claims its a civil law vs common law country difference - not sure the citation shows that though.

OneDeuxTriSeiGo an hour ago | parent | prev [-]

It's not specific to open source but it's most clearly enforceable with open source as there will be many contributors from many jurisdictions with the one unifying factor being they all made their copyright available under the same license terms.

With proprietary or more importantly single-owner code, it's far easier for this to end up in a settlement rather than being drug out into an actual ruling, enforcement action, and establishment of precedence.

That's the key detail. It's not specific to GPL or open source but if you want to see these orgs held to account and some precedence established, focusing on GPL and FOSS licensed code is the clearest path to that.

zamadatix 2 hours ago | parent | prev | next [-]

The article goes deep into these two cases deemed most relevant but really there are a wide swath of similar cases all focused around defining sharper borders than ever around what is essentially the question "exactly when does it become copyright violation" with plenty of seemingly "obvious" answers which quickly conflict with each other.

I also have the feeling it will be much like Google LLC v. Oracle America, Inc., much of this won't really be clearly resolved until the end if the decade. I'd also not ve surprised if seemingly very different answers ended up bubbling up in the different cases, driven by the specifics of the domain.

Not a lawyer, just excited to see the outcomes :).

twoodfin an hour ago | parent [-]

Ideally, Congress would just settle this basket of copyright concerns, as they explicitly have the power to do—and have done so repeatedly in the specific context of computers and software.

jeremyjh 42 minutes ago | parent [-]

What is ideal about getting more shitty laws written at the behest of massive tech companies? Do you think the DMCA is a good thing?

twoodfin 28 minutes ago | parent [-]

As opposed to waiting for uncertain court cases (based on the existing shitty laws) to play out for years, ultimately decided by unelected judges?

Democracy is the worst system we’ve tried, except for all the others.

(Also: The GPL can only be enforced because of laws passed by Congress in the late ‘70’s and early ‘80’s. And believe you me, people said all the same kinds of things about those clowns in Congress. Plus ça change…)

jeremyjh 10 minutes ago | parent [-]

Courts applying legal analysis to existing law and precedent is also an operation of democracy in action and lately they've been a lot better at it than legislators. I don't know if you've noticed, but the quality of our legislators has substantially deteriorated since the 80s, when 24-hour news networks became a thing. It got even worse after the Citizens United decision and social media became a thing. "No new laws" is really the safest path these days.

myrmidon an hour ago | parent | prev | next [-]

I honestly think that the most extreme take that "any output of an LLM falls under all the copyright of all its training data" is not really defensible, especially when contrasted with human learning, and would be curious to hear conflicting opinions.

My view is that copyright in general is a pretty abstract and artificial concept; thus corresponding regulation needs to justifiy itself by being useful, i.e. encouraging and rewarding content creation.

/sidenote: Copyright as-is barely holds up there; I would argue that nobody (not even old established companies) is significantly encouraged or incentivised by potential revenue more than 20 years in the future (much less current copyright durations). The system also leads to bad ressource allocation, with almost all the rewards ending up at a small handful of most successful producers-- this effectively externalizes large portions of the cost of "raising" artists.

I view AI overlap under the same lense-- if current copyright rules would lead to undesirable outcomes (by making all AI training or use illegal/infeasible) then law/interpretation simply has to be changed.

jeremyjh 13 minutes ago | parent [-]

Anyone can very easily avoid training on GPL code. Yes, the model might be not be as strong as one that is trained that way and released under terms of the GPL, but to me that sounds like quite a good outcome if the best models are open source/open weight.

Its all about whose outcomes are optimized.

Of course, the law generally favors consideration of the outcomes for the massive corporations donating hundreds of millions of dollars to legislature campaigns.

phplovesong an hour ago | parent | prev | next [-]

We need a new license that forbids all training. That is the only way to stop big corporations from doing this.

maxloh an hour ago | parent | next [-]

To my understanding, if the material is publicly available or obtained legally (i.e., not pirated), then training a model with it falls under fair use, at least in the US and some other jurisdictions.

If the training is established as fair use, the underlying license doesn't really matter. The term you added would likely be void or deemed unenforceable if someone ever brought it to a court.

rileymat2 35 minutes ago | parent | next [-]

It depends on the license terms, if you have a license that allowed you to get it legally where you agreed to those terms it would not be legal for that purpose.

But this is all grey area… https://www.authorsalliance.org/2023/02/23/fair-use-week-202...

justin_murray an hour ago | parent | prev | next [-]

This is at least murky, since a lot of pirated material is “publicly available”. Certainly some has ended up in the training data.

michaelmrose 38 minutes ago | parent [-]

It isn't? You have to break the law to get it. It's publicly available like your TV is if I were to break into your house and avoid getting shot.

basilgohar 27 minutes ago | parent | next [-]

That isn't even remotely a sensible analogy. Equating copyright violation with stealing physical property is an extremely failed metaphor.

MangoToupe 27 minutes ago | parent | prev [-]

Maybe you have some legalistic point that escapes comprehension, but I certainly consider my house to be much private and the internet public.

colechristensen 43 minutes ago | parent | prev [-]

I wouldn't say this is settled law, but it looks like this is one of the likely outcomes. It might not be possible to write a license to prevent training.

munchler 43 minutes ago | parent | prev | next [-]

By that logic, humans would also be prevented from “training” on (i.e. learning from) such code. Hard to see how this could be a valid license.

psychoslave 37 minutes ago | parent | next [-]

Isn’t it the very reason why we need cleanroom software engineering:

https://en.wikipedia.org/wiki/Cleanroom_software_engineering

codedokode 27 minutes ago | parent | prev [-]

Bad analogy, probably made up by capitalists to confuse people. ML models cannot and do not learn. "learning" is a name of a process, when model developer downloads pirated material and processes it with an algorithm (computes parameters from it).

Also, humans do not need to read million of pirated books to learn to talk. And a human artist doesn't need to steal million pictures to learn to draw.

WithinReason an hour ago | parent | prev | next [-]

Wouldn't it be still legal to train on the data due to fair use?

gus_massa an hour ago | parent [-]

I don't think it's fair use, but everyone on Earth disagree with me. So even with the standard default licence that prohibits absolutely everything, the humanity-1 consider it fair use.

justin_murray 36 minutes ago | parent [-]

Honest question: why don’t you think it is fair use?

I can see how it pushes the boundary, but I can’t lay out logic that it’s not. The code has been publish for the public to see. I’m always allowed to read it, remember it, tell my friends about it. Certainly, this is what the author hoped I would do. Otherwise, wouldn’t they have kept it to themselves?

These agents are just doing a more sophisticated, faster version of that same act.

mixedbit 24 minutes ago | parent [-]

Before LLMs programmers had pretty good intuition what GPL license allowed for. It is of course clear that you cannot release a closed source program with GPL code integrated into it. I think it was also quite clear, that you cannot legally incorporate GPL code into such a program, by making changes here and there, renaming some stuff, and moving things around, but this is pretty much what LLMs are doing. When humans do it intentionally, it is violation of the license, when it is automated and done on a huge scale, is it really fair use?

James_K an hour ago | parent | prev | next [-]

Would such a license fall under the definition of free software? Difficult to say. Counter-proposition: a license which permits training if the model is fully open.

amszmidt an hour ago | parent | next [-]

It isn't the difficult, a license that forbids how the program is used is a non-free software license.

"The freedom to run the program as you wish, for any purpose (freedom 0)."

Orygin 28 minutes ago | parent | next [-]

Yet the GPL imposes requirements for me and we consider it free software.

You are still free to train on the licensed work, BUT you must meet the requirements (just like the GPL), which would include making the model open source/weight.

helterskelter 32 minutes ago | parent | prev [-]

Running the program and analyzing the source code are two different things...?

amszmidt 8 minutes ago | parent [-]

In the context of Free Software, yes. Freedom one is about the right to study a program.

Orygin an hour ago | parent | prev | next [-]

My next project will be released under a GPL-like license with exactly this condition added. If you train a model on this code, the model must be open source & open weights

fouronnes3 an hour ago | parent [-]

Not sure why the FSF or any other organization hasn't released a license like this years ago already.

amszmidt an hour ago | parent [-]

Because it would violate freedom zero. Adding such terms to the GNU GPL would also mean that you can remove them, they would be considered "further restrictions" and can be removed (see section 7 of the GNU GPL version 3).

Orygin 31 minutes ago | parent [-]

Freedom 0 is not violated. GPL includes restrictions for how you can use the software, yet it's still open source.

You can do whatever you want with the software, BUT you must do a few things. For GPL it's keeping the license, distributing the source, etc. Why can't we have a different license with the same kind of restrictions, but also "Models trained on this licensed work must be open source".

Edit: Plus the license would not be "GPL+restriction" but a new license altogether, which includes the requirements for models to be open.

amszmidt 9 minutes ago | parent [-]

That is not really correct, the GNU GPL doesn't have any terms whatsoever on how you can use, or modify the program to do things. You're free to make a GNU GPL program do anything (i.e., use).

I suggest a careful reading of the GNU GPL, or the definition of Free Software, where this is carefully explained.

tomrod an hour ago | parent | prev [-]

Model weights, source, and output.

scotty79 an hour ago | parent | prev [-]

We need a ruling that LLM generated code enters public domain automatically and can't be covered by any license.

palata 43 minutes ago | parent | next [-]

But then we would need a way to prove that some code was LLM generated, right?

Like if I copy-paste GPL-licenced code, the way you realise that I copy-pasted it is because 1) you can see it and 2) the GPL-licenced code exists. But when code is LLM generated, it is "new". If I claim I wrote it, how would you oppose that?

michaelmrose 36 minutes ago | parent | prev [-]

Laws exist to protect those who make and have money. If trillions could be made harvesting your kids kidneys it would be legal.

basilgohar 22 minutes ago | parent [-]

It's done extrajudicially in warzones such as Palestine where hostages are returned from Israeli jails, with missing organs, dead or alive [0].

[0] https://factually.co/fact-checks/justice/evidence-investigat...

ljlolel an hour ago | parent | prev | next [-]

And then also to all code made from the GPL’d ai model?

maxloh an hour ago | parent [-]

A program's output is likely not owned by the program's authors. For example, if you create a document with Microsoft Word, you are the one who owns it, not Microsoft.

javcasas 18 minutes ago | parent | next [-]

You sure about that? Have you checked the 400-page EULA?

pessimizer an hour ago | parent | prev [-]

Unless the license says otherwise. The fact that Word doesn't (I wouldn't even be sure if that was true, honestly, especially for the online versions) doesn't mean anything.

They could start selling a version of Word tomorrow that gives them the right to train from everything you type on your entire computer into any program. Or that requires you to relinquish your rights to your writing and to license it back from Microsoft, and to only be able to dispute this through arbitration. They could add a morals clause.

pessimizer an hour ago | parent | prev | next [-]

I might be crazy, and I'd love to hear from somebody who knows about this, but I've been assuming that AI companies have been pulling GPL code out of the training material specifically to avoid this.

Corporations have always talked about the virality of GPL, sometimes but not always to the point of exaggeration, you'd think that after getting the proof of concept done the AI companies would be running away at full speed from setting a bomb like that in their goldmine.

Putting in tons of commonly read books and scientific papers is safer, they can just eventually cross-license with the massive conglomerates that own everything. But the GPL is by nature hostile, and has been openly and specifically hostile from the beginning. MIT and Apache, etc. you can just include a fistful of licenses to download, or even come up with architectures that track names to add for attribution-ware. But the GPL will obviously (and legitimately) claim to have relicensed the entire model and maybe all its output (unless they restricted it to LGPL.)

Wouldn't you just pull it out?

NateEag an hour ago | parent | next [-]

If you were a thoughtful, careful, law-abiding business, yes.

I submit the evidence suggests the genAI companies have none of those attributes.

NiloCK an hour ago | parent | prev | next [-]

Not crazy - there's a rational self-interest in doing this.

But I'm not certain that the relevant players have the same consequence-fearing mindset that you do, and to be honest they're probably right. The theft is too great to calculate the consequences, and by the time it's settled, what are you gonna do - turn off Forster's machine?

I hope you're right in at least some cases!

pessimizer 27 minutes ago | parent [-]

> by the time it's settled

Why would the GPL settle? Even more, who is authorized to settle for every author who used the GPL? If the courts decided in favor of the GPL, which I think would be likely just because of the age and pervasiveness of the GPL, they'd actually have to lobby Congress to write an exception to copyright rules for AI.

A large part of the infrastructure of the world is built on the GPL, and the people who wrote it were clearly motivated by the protection that they thought that the GPL would give to what was often a charitable act, or even an act that would allow companies to share code without having to compete with themselves. I can't imagine too many judges just going "nope."

exasperaited an hour ago | parent | prev [-]

> I might be crazy, and I'd love to hear from somebody who knows about this, but I've been assuming that AI companies have been pulling GPL code out of the training material specifically to avoid this.

Haha no.

https://windsurf.com/blog/copilot-trains-on-gpl-codeium-does...

And just in the last two days, AI generating LGPL headers (which it could not do if identifying LGPL code was pulled from the codebase) and misattributing authors:

https://devclass.com/2025/11/27/ocaml-maintainers-reject-mas...

dmezzetti an hour ago | parent | prev | next [-]

As someone who has spent a fair amount of time developing open source software, I will say I genuinely dislike copyleft and GPL.

For those who are into freedom, I don't see how dictating how you use what you build in such a manner is in the spirit of free and open.

Just my opinion on it, to each their own on the matter.

myrmidon an hour ago | parent | next [-]

I had a very similar view once, and have since understood that this is mainly a difference in perspective:

It's easy as a developer to slip into a role where you want to build/package (maybe sell) some software product with minimal obligations. BSD-likes are obviously great there.

But the GPL follows a different perspective: It tries to make sure that every user of any software product is always capable of tinkering and changing it himself, and the more permissive licenses do not help there because they don't prevent (or even discourage!) companies from just selling you stripped and obfuscated binary blobs that put you fully at the vendors mercy.

dmezzetti 42 minutes ago | parent [-]

I understand people want to control what happens once they build something. Too often do you see startups go with a permissive model only to go to a more restrictive model once something like that happens. Then it ends up upsetting a lot of people.

I'm of the opinion that what I build, I'm willing to share it and let others use it as they see fit even if it's not to my advantage.

myrmidon 20 minutes ago | parent [-]

I think the GPL mainly suffers with startups because it makes monetization pretty difficult. Some "commercial" uses of it are also giving it somewhat of an undeserved bad taste (when companies use it to benefit from free contributions while preventing competitors from getting any use out of it).

My view is that every project and library where I can peruse the source is a gift/privilege. GPL restrictions I view as a small price to "pay it forward", and to keep that privilege for all wherever possible.

dmezzetti 15 minutes ago | parent [-]

Fair enough. You'd like to hope that there is a voluntary "pay it back and forward" mentality. But I understand that is a leap of faith with a lot of blind trust.

amenhotep an hour ago | parent | prev | next [-]

It's not dictating how you use what you build? It's dictating how you redistribute what you build on top of other people's work.

dmezzetti an hour ago | parent [-]

Ok but I just have no interest in imposing restrictions on how people distribute what I build in such a manner either. That's just me.

cdelsolar an hour ago | parent | prev | next [-]

I disagree as someone who has also spent a huge amount of time on open source software. It’s all GPL or AGPL :)

dmezzetti an hour ago | parent [-]

That's your prerogative. It's just not for me and GPL is basically something I avoid when possible.

pessimizer an hour ago | parent | prev [-]

As somebody who thinks that people currently own the code that they write, I wonder why you're in people's business who want to write GPL'd software.

Are you complaining about proprietary software? I hear the restrictions are a lot tighter for Photoshop's source code, or iOS's, but for some reason you are one of the people who hate GPL as a hobby. Please don't show up whining about "spirits" when Amazon puts you out of business.

dmezzetti an hour ago | parent [-]

I'm not in anyone's business just sharing my opinion on GPL. I understand why people go GPL / AGPL just not for me. To each their own if they want to go down that path.

simgt 2 hours ago | parent | prev | next [-]

What triggers me is how insistant Claude Code is on adding "co-authored by Claude" in commits, in spite of my settings and an instruction in CLAUDE.md. I wish all these tech bros were as willing to credit the human shoulders on which their products are built. But they'd be much less successful in our current system if they were that kind of people.

euazOn an hour ago | parent [-]

Try changing the system prompt or switch to opencode [0] - they allegedly reverse engineered Claude Code, and so the performance you get with Claude models should be very similar to Claude Code.

[0] https://github.com/sst/opencode

simgt an hour ago | parent | next [-]

I've changed the settings and added the instruction to the prompt, hence my frustration :)

patrick91 an hour ago | parent | prev [-]

there's an option for claude to disable co-authoring, see: https://code.claude.com/docs/en/settings

{ "includeCoAuthoredBy": false }

rvnx 2 hours ago | parent | prev | next [-]

GPL and copyright in general don't apply to billionaires, so pretty much a non-topic.

It's just a side cost of doing business, because asking for forgiveness is cheaper and faster than asking for permission.

throwaway198846 an hour ago | parent | next [-]

"Information wants to be free"? Many individuals pirated movies and games and got away with it. Of course two wrongs don't make a right and all that. Nonetheless one should be compensated for creating material that ai trained on for the same reasons copyright is compensated - to incentives people to produce it.

rando77 an hour ago | parent | prev [-]

With an attitude like that they don't

pclmulqdq 2 hours ago | parent | prev [-]

I thought the whole concept of a viral license was legally questionable to begin with. There haven't been cases about this, as far as I know, and GPL virality enforcement has just been done by the community.

omnicognate an hour ago | parent | next [-]

The GPL was tested in court as early as 2006 [1] and plenty of times since. There are no serious doubts about its enforceability.

[1] https://www.fsf.org/news/wallace-vs-fsf

pclmulqdq an hour ago | parent | next [-]

That case has little to do with the license itself and nothing to do with its virality.

omnicognate 41 minutes ago | parent [-]

As I said, that was merely the first of many. And there is no such thing as "virality" - see my answer to the sibling to your comment.

The "enforceability" of the GPL was never in any doubt because it's not a contract and doesn't need to be "enforced". The license grants you freedoms you otherwise may not have under copyright. It doesn't deny you any freedoms you would otherwise have, and it cannot do so because it is not a contract. If the terms of the GPL don't apply to your use then all you have is the normal freedoms under copyright law, which may prohibit it. If so, any "enforcement" isn't enforcement of the GPL. It's enforcement of copyright, and there's certainly no doubt on the enforceability of that.

For the GPL to "fail" in court it would have be found to effectively grant greater freedoms than it was designed to do (or less, resulting in some use not being allowed when it should be, but that's not the sort of case being considered here). It doesn't, and it has repeatedly stood up in court as not granting additional freedoms than were intended.

zamadatix an hour ago | parent | prev [-]

I know it's not popular on HN to have anything but supportive statements around GPL, and I'm a big GPL supporter myself, but there is nuance in what is being said here.

That case was important, but it's not abojt the virality. There have been no concluded court cases involving the virality portion causing the rest of the code to also be GPL'd, but there are plenty involving enforcement of GPL on the GPL code itself.

The distinction is important because the article is about the virality causing the whole LLM model to be GPL'd, not just about the GPL'd code itself.

I'd like to think it wouldn't be a problem to enforce, but I've also never seen a court ruling truly about the virality portion to back that up either - which is all GP is saying.

omnicognate an hour ago | parent [-]

There is no "virality", and the article's use of "propagation" to mean the same thing is wrong. The GPL doesn't "cause" anything to be GPLed that hasn't been explicitly licensed under the GPL by the owner of its copyright. The GPL grants a license to use the copyright material to which it applies. To satisfy the terms of that license for a particular use may require that you license other code under the GPL, but if you don't the GPL can't magically make that code GPLed. You will, however, not be covered by the license so unless your use is permitted for some other reason (eg. fair use or a different license you have been granted) your use of the the original code will be a violation of copyright. All of this has been repeatedly tested in court.

It's sad to see Microsoft's FUD still festering 20 years later.

pessimizer 41 minutes ago | parent [-]

It's not Microsoft FUD, you're describing the license as viral too, but playing with words. The fact is that if you include GPL'd stuff in your stuff, that assemblage has to conform to the GPL's rules.

You're basically saying "the GPL doesn't go back in time and relicense unrelated code." But nobody was ever claiming it does, and describing it as "viral" doesn't imply that it does. It's "viral" because code that you stick to it has to conform to its rules. It's good that the GPL is viral. I want it to be viral, I don't want people to be able to hide GPL'd code in a proprietary structure.

CamouflagedKiwi an hour ago | parent | prev | next [-]

There have been a number of of cases, which are linked from Wikipedia (https://en.wikipedia.org/wiki/GNU_General_Public_License#Leg...) - most recently Entr’Ouvert v. Orange had a strong judgement (under French law) in favour of the GPL.

Conversely, to my knowledge there has been no court decision that indicates that the GPL is _not_ enforceable. I think you might want to be more familiar with the area before you decide if it's legally questionable or not.

pclmulqdq an hour ago | parent [-]

I'm not suggesting that you avoid following it. I'm just not that convinced it's enforceable in the US. The French ruling is good, though.

iso1631 an hour ago | parent | prev [-]

If you don't like the license, then don't accept it.

You are then restricted by copyright just like with any other creation.

If I include the source code of Windows into my product, I can't simply choose to re-license it to say public domain and give it to someone else, the license that I have from Microsoft to allow me to use their code won't let me - it provides restrictions. It's just as "viral" as the GPL.

pclmulqdq an hour ago | parent [-]

I like the GPL. I just don't know how much you can actually enforce it.

Also, "don't use my code" is not viral. If you break the MSFT license, you pay them, which is a very well-tested path in courts. The idea of forced public disclosure does not seem to be.