Ignoring what this model architecture could do and just considering what this model does do, why would I (or anyone) want to run this model (locally) to do <insert use-case>? Is it entirely a proof-of-concept for future training on medical data? Are they looking to use this to attempt to ethically justify training on (free-tier) user's personal data via the application of noise to the training data?

▲

malfist 3 days ago | parent | next [-]

You can hide that you pirated content for training

	▲	astrange 2 days ago \| parent [-]
		You can't hide that. You can't use technical measures to hide from discovery. I think an entire book is a little too large to mask with this method and still end up learning anything.

▲

faangguyindia 3 days ago | parent | prev | next [-]

U can avoid book publisher lawsuit which Anthropic is dealing with using this approach

▲

porridgeraisin 4 days ago | parent | prev | next [-]

It's the last option.

The whole framing of DP is:

Probability that you reveal private info is same whether or not you train on a particular users data.

It is useful in many cases, but google the product company specifically is going to use it for ads.

▲

floridianfisher 4 days ago | parent | prev [-]

The purpose is research