Remix.run Logo
ghuntley 8 days ago

> - weird copyright / IP questions all around. This will be a minefield.

Yeah, we're in weird territory because you can drive an LLM as a Bitcoin mixer over intellectual property. That's the entire point/meaning behind https://ghuntley.com/z80.

You can take something that exists, distill it back to specs, and then you've got your own IP. Throw away the tainted IP, and then just run Ralph over a loop. You are able to clone things (not 100%, but it's better than hiring humans).

whs 7 days ago | parent | next [-]

I wrote an MCP based on that technique - https://github.com/whs/mcp-chinesewall

Basically to avoid the ambiguity of training LLM from unlicensed code, I use it to generate description of the code to another LLM trained from permissively licensed code. (There aren't any usable public domain models I've found)

I use it in real world and it seems that the codegen model work 10-20% of the time (the description is not detailed enough - which is good for "clean room" but a base model couldn't follow that). All models can review the code, retry and write its own implementation based on the codegen result though.

ghuntley 7 days ago | parent [-]

Nice. Any chance you could put in some attributions and credits in your paper? https://orcid.org/0009-0007-3955-9994

whs 7 days ago | parent [-]

I never read your work though (and still haven't since it's paywalled), I just discovered today that we independently discovered the same thing.

heavyset_go 8 days ago | parent | prev | next [-]

> then you've got your own IP.

AI output isn't copyrighted in the US.

miohtama 7 days ago | parent [-]

He is referring to taking AI output and making it your company's property.

AlexandrB 7 days ago | parent [-]

If AI output can't be copyrighted it can't be your company's property, just the company's secret. And you can't sue anyone who uses the secret if it gets out.

sitkack 8 days ago | parent | prev | next [-]

repoMirror is the wrong name, aiCodeLaundering would be more accurate. This is bulk machine translation from one language to another, but in this case, it is code.

rasz 8 days ago | parent | prev [-]

>and then you've got your own IP.

except you dont