| ▲ | nicole_express 3 hours ago | |||||||
Not a lawyer, but my understanding is: In theory, copyright only protects the creative expression of source code; this is the point of the "clean room" dance, that you're keeping only the functional behavior (not protected by copyright). Patents are, of course, an entirely different can of worms. So using an LLM to strip all of the "creative expression" out of source code but create the same functionality feels like it could be equivalent enough. I like the article's point of legal vs. legitimate here, though; copyright is actually something of a strange animal to use to protect source code, it was just the most convenient pre-existing framework to shove it in. | ||||||||
| ▲ | dathinab an hour ago | parent [-] | |||||||
> this is the point of the "clean room" dance which is the actual relevant part: they didn't do that dance AFIK AI is a tool, they set it up to make a non-verbatim copy of a program. Then they feed it the original software (AFIK). Which makes it a side by side copy, as in the original source was used as reference to create the new program. Which tend to be seen as derived work even if very different. IMHO They would have to: 1. create a specification of the software _without looking at the source code_, i.e. by behavior observation (and an interface description). I.e. you give the AI access to running the program, but not to looking into the insides of it. I really don't think they did it as even with AI it's a huge pain as you normally can't just brute force all combinations of inputs and instead need to have a scientific model=>test=>refine loop (which AI can do, but can take long and get stuck, so you want it human assisted, and the human can't have inside knowledge about the program). 2. then generate a new program from specification, And only from it. No git history, no original source code access, no program access, no shared AI state or anything like that. Also for the extra mile of legal risk avoidance do both human assisted and use unrelated 3rd parties without inside knowledge for both steps. While this does majorly cut cost of a clean room approach, it still isn't cost free. And still is a legal mine field if done by a single person, especially if they have enough familiarity to potentially remember specific peaces of code verbatim. | ||||||||
| ||||||||