Remix clone Hacker News

new | show | ask | jobs Github

	▲	CuriouslyC 2 years ago
		The output of a model can be copyright violation. In fact, even if the model was never trained on copyright content, if I provided copyright text then told the model to regurgitate it verbatim that would be a violation. That does not make the model copyright violation itself.
	▲	throw646577 2 years ago \| parent [-]
		This is is sort of like the argument against a blank tape levy or a tape copier tax, which is a reasonable argument in the context of the hardware. But an LLM doesn't just enable direct duplication, it (well its model) contains it. If software had a meaningful distribution cost or per-unit sale cost, a blank tape tax would be very appropriate for LLM sales. But instead OpenAI is operating a for-pay duplication service where authors don't get a share of the proceeds -- it is doing the very thing that copyright laws were designed to dissuade by giving authors a time-limited right to control the profits from reproducing copies of their work.