▲ | cedws 7 days ago | |||||||||||||||||||||||||||||||||||||||||||||||||||||||
I don’t think OpenAI train on data processed via the API, unless there’s an exception specifically for this. | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||
▲ | dpoloncsak 7 days ago | parent | next [-] | |||||||||||||||||||||||||||||||||||||||||||||||||||||||
Maybe I misunderstand, but I'm pretty sure they offer an option for cheaper API costs (or maybe its credits?) if you allow them to train on your API requests. To your point, pretty sure it's off by default, though Edit: From https://platform.openai.com/settings/organization/data-contr... Share inputs and outputs with OpenAI "Turn on sharing with OpenAI for inputs and outputs from your organization to help us develop and improve our services, including for improving and training our models. Only traffic sent after turning this setting on will be shared. You can change your settings at any time to disable sharing inputs and outputs." And I am 'enrolled for complimentary daily tokens.' | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||
▲ | trhway 7 days ago | parent | prev | next [-] | |||||||||||||||||||||||||||||||||||||||||||||||||||||||
i'd not rule out some approach like instead of training directly on the data, may be they would train on a very high dimensional embedding of such a data (or some other similarly "anonymized", yet still very semantically rich representation of the data) | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||
▲ | dannyw 7 days ago | parent | prev | next [-] | |||||||||||||||||||||||||||||||||||||||||||||||||||||||
Can you truly trust them though? | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||
▲ | johnthescott 7 days ago | parent | prev [-] | |||||||||||||||||||||||||||||||||||||||||||||||||||||||
i am too lazy to ask openai. |