| ▲ | bee_rider 5 hours ago | |
Does the law really not distinguish between mechanical processing of data, and humans learning from it? It seems surprising to be if every person who read a textbook is copyright infringing. It also seems surprising if something like a lossy compression algorithm is enough to protect you from copyright law. Somewhere between the two a line must be drawn… where we’d want to put that line, I guess, if up for quibbling. But it doesn’t seem obvious to me. | ||
| ▲ | gruez 4 hours ago | parent | next [-] | |
>Does the law really not distinguish between mechanical processing of data, and humans learning from it? It seems surprising to be if every person who read a textbook is copyright infringing. It also seems surprising if something like a lossy compression algorithm is enough to protect you from copyright law. The google books and google thumbnails cases have so far upheld that even mechanical reproductions are allowed, depending on the context/usage. | ||
| ▲ | mapontosevenths 4 hours ago | parent | prev [-] | |
To me the distinction hinges on the output being transformative enough to be considered a new work. I think that most of the time LLM output is. Sometimes they go a bit wonky and overtrain on specific phrases which can result in verbatim copies of brief sections of coontent. Thats a bug, not a feature. | ||