| ▲ | anonymous908213 9 hours ago |
| This is already possible for Harry Potter specifically. There was a study demonstrating that Sonnet 3.7, among other models tested, could reproduce the first Harry Potter book 95.8% verbatim[1]. [1] https://arxiv.org/abs/2601.02671 |
|
| ▲ | 9 hours ago | parent | next [-] |
| [deleted] |
|
| ▲ | dom96 9 hours ago | parent | prev | next [-] |
| Thanks for linking! I've been thinking about trying something like this myself. |
|
| ▲ | Legend2440 9 hours ago | parent | prev [-] |
| ...only if you deliberately attempt to extract it by repeatedly prompting it to complete fragments of the book. They had to do quite a bit of work to make this happen. |
| |
| ▲ | dom96 8 hours ago | parent [-] | | so? It demonstrates that LLM models retain the copyrighted material in their weights. This is an important thing to consider about LLMs and shows that there need to be better protections for the creative industry. | | |
| ▲ | fc417fc802 8 hours ago | parent | next [-] | | Really? I retain plenty of copyrighted material in my head. What matters is the contexts in which I reproduce it (if any). A search index might also contain copyrighted material. As long as it's used for search queries as opposed to regurgitation there's no problem. Search indexes and LLMs are both clearly very beneficial tools to have access to. | | |
| ▲ | themafia 7 hours ago | parent | next [-] | | Reproduce it. Sit in a clean room and write it all out. Then go check your accuracy. I'm curious to see what it is. | | |
| ▲ | fc417fc802 7 hours ago | parent [-] | | What does this (thought) experiment accomplish? That is, what point are you trying to make here? Since we're talking about an electronic system the search index example is the more directly relevant one. Anyone who wants to object to LLMs is going to need to take care to ensure consistency with his views on Google's search index. | | |
| ▲ | themafia 7 hours ago | parent [-] | | I wasn't aware I could read 95% of Harry Potter through constructed queries using Google's search index. Can you demonstrate how I might do this? Also can you point out how copyright law changes because we're using an "electronic system" as opposed to an "analog system?" |
|
| |
| ▲ | _DeadFred_ 7 hours ago | parent | prev [-] | | Are you a for profit product? |
| |
| ▲ | PeterStuer 4 hours ago | parent | prev [-] | | "there need to be better protections for the creative industry" Why exactly? |
|
|