Remix.run Logo
terminalshort 3 days ago

Where is your evidence that Meta, Google, and OpenAI did the same thing? (As for NVIDIA, do they even train models?) Because if they did, why haven't they been sued? This is a garden variety copyright infringement case and would be a slam dunk win for the plaintiffs. The only novel part of the case is the claim that the plaintiffs lost on, which establishes president that training an LLM is fair use.

> Clearly something should be done because it's not like these companies can't afford the cost of the books

Yes indeed it should, and it has. They have been forced to pay $3000 per book they pirated, which is more than 100x what they would have gained if they had gotten away with it.

IMO a fine of 100x the value of a copy of the pirated work is more than sufficient as a punishment for piracy. If you want to argue that the penalty should be more, you can do that, but it is completely missing my point. You are talking about what is fair punishment to the companies, and my comment was talking about what is fair compensation to the authors. Those are two completely different things.

jimmydorry 3 days ago | parent | next [-]

> IMO a fine of 100x the value of a copy of the pirated work is more than sufficient as a punishment for piracy.

Anti-piracy groups use scare letters on pirates where they threaten to sue for tens of thousands of dollars per instance of piracy. Why should it be lower for a company?

vidarh 3 days ago | parent | prev | next [-]

> As for NVIDIA, do they even train models?

Yes. Nemotron:

https://www.nvidia.com/en-gb/ai-data-science/foundation-mode...

godelski 3 days ago | parent | prev [-]

I mean you can Google these... They also have been popping up on HN for the last year, it is even referenced in the article, and there's even another post in the sidebar titled "Anthropic Record AI Copyright Pact Sets Bar for OpenAI, Meta"[0], so I really didn't feel it was necessary to provide links. But sure, if you're feeling lazy, I got your back. I'll even limit it to HN posts so you don't have to even leave the site

  Torrenting:
  Meta Pirating Books[1,2,3]
    - [1] Fun fact, [1] is the most popular post of all time on HN for the search word "torrent" and the 5th ranking for "Meta". [2] is the 16th for "illegal"
  Nvidia [4,5]
  Apple, Nvidia, Anthropic[6]
  GitHub [7,8]
  OpenAI [9,10]
  Google [11]
    - I mean this one was even mentioned in the articled from the Anthropic post from a few days ago[12]
I hope that's sufficient. You can find plenty more if you do a good old fashion search instead of just using the HN search. But most of these were pretty high profile stories so was pretty quick to look.

  > which establishes president that training an LLM is fair use.
                      ~~~~~~~~~
                      precedent
I think you misunderstand. The precedent is over the issue of piracy. This has not made precedence over the issue of fair use. There is ongoing litigation, but there was precedence set in another lawsuit with Meta[13], which is currently going through appeals. I'll give you a head start on that one [14,15]. But the issue of fair use is still being debated. These things take years and I don't think anyone will be surprised when this stuff lands in some of the highest courts and gets revisited in a different administration.

  > IMO a fine of 100x the value of a copy of the pirated work is more than sufficient as a punishment for piracy.
Sure. You can have whatever opinion you want. I wasn't arguing about your opinion. I even agreed with it[16]!

But that is a different topic all together. I still think you've vastly over simplified the conversation and thus unintentionally making some naive assumptions. It's the whole reason I said "probably" in [16]. The big difference being just that you're smart enough to figure out how law works and I'm smart enough to know that neither of us are lawyers.

And please don't ask me for more citations unless they are difficult to Google... I think I already set some kinda record here...

  [0] https://archive.is/3oCg8
  [1] https://news.ycombinator.com/item?id=42971446
  [2] https://news.ycombinator.com/item?id=43125840
  [3] https://news.ycombinator.com/item?id=42772771
  [4] https://news.ycombinator.com/item?id=40505480
  [5] https://news.ycombinator.com/item?id=41163032
  [6] https://news.ycombinator.com/item?id=40987971
  [7] https://news.ycombinator.com/item?id=33457063
  [8] https://news.ycombinator.com/item?id=27724042
  [9] https://news.ycombinator.com/item?id=42273817
  [10] https://news.ycombinator.com/item?id=38781941
  [11] https://news.ycombinator.com/item?id=11520633
  [12] https://news.ycombinator.com/item?id=45142885
  [13] https://perkinscoie.com/insights/update/court-sides-meta-fair-use-and-dmca-questions-leaves-door-open-future-challenges
  [14] https://arstechnica.com/tech-policy/2025/07/meta-pirated-and-seeded-porn-for-years-to-train-ai-lawsuit-says/
  [15] https://torrentfreak.com/copyright-lawsuit-accuses-meta-of-pirating-adult-films-for-ai-training/
  [16] https://news.ycombinator.com/item?id=45190232