▲ | ghuntley 8 days ago | ||||||||||||||||
> - weird copyright / IP questions all around. This will be a minefield. Yeah, we're in weird territory because you can drive an LLM as a Bitcoin mixer over intellectual property. That's the entire point/meaning behind https://ghuntley.com/z80. You can take something that exists, distill it back to specs, and then you've got your own IP. Throw away the tainted IP, and then just run Ralph over a loop. You are able to clone things (not 100%, but it's better than hiring humans). | |||||||||||||||||
▲ | whs 7 days ago | parent | next [-] | ||||||||||||||||
I wrote an MCP based on that technique - https://github.com/whs/mcp-chinesewall Basically to avoid the ambiguity of training LLM from unlicensed code, I use it to generate description of the code to another LLM trained from permissively licensed code. (There aren't any usable public domain models I've found) I use it in real world and it seems that the codegen model work 10-20% of the time (the description is not detailed enough - which is good for "clean room" but a base model couldn't follow that). All models can review the code, retry and write its own implementation based on the codegen result though. | |||||||||||||||||
| |||||||||||||||||
▲ | heavyset_go 8 days ago | parent | prev | next [-] | ||||||||||||||||
> then you've got your own IP. AI output isn't copyrighted in the US. | |||||||||||||||||
| |||||||||||||||||
▲ | sitkack 8 days ago | parent | prev | next [-] | ||||||||||||||||
repoMirror is the wrong name, aiCodeLaundering would be more accurate. This is bulk machine translation from one language to another, but in this case, it is code. | |||||||||||||||||
▲ | rasz 8 days ago | parent | prev [-] | ||||||||||||||||
>and then you've got your own IP. except you dont |