| ▲ | throw310822 7 hours ago | |||||||
> The training data If the prompt is unique, it is not in the training data. True for basically every prompt. So how is this probability calculated? | ||||||||
| ▲ | cbovis 7 hours ago | parent | next [-] | |||||||
The prompt is unique but the tokens aren't. Type "owejdpowejdojweodmwepiodnoiwendoinw welidn owindoiwendo nwoeidnweoind oiwnedoin" into ChatGPT and the response is "The text you sent appears to be random or corrupted and doesn’t form a clear question." because the prompt doesnt correlate to training data. | ||||||||
| ||||||||
| ▲ | qsera 7 hours ago | parent | prev | next [-] | |||||||
Just using a scaled up and cleverly tweaked version of linear regression analysis... | ||||||||
| ||||||||
| ▲ | hmmmmmmmmmmmmmm 6 hours ago | parent | prev [-] | |||||||
Hamiltonian paths and previous work by Donald Knuth is more than likely in the training data. | ||||||||
| ||||||||