Remix.run Logo
dormento a day ago

For all we know, AI tech companies could theoretically have converted all of the "acquired" (ahem!) training set material into base64 and used it for training as well, just like you would encode say japanese romaji or hebrew written in the english alphabet.

dtj1123 a day ago | parent | next [-]

Unlikely that every company would have bothered to do this.

idiotsecant a day ago | parent | prev [-]

'Yes, I know we already trained on all that data, but now I want you to convert to base64 and train it again! at enormous cost!'

adcoleman6 5 hours ago | parent [-]

On the contrary, it could be a deliberate attempt to augment or diversify the dataset.