| ▲ | xg15 9 hours ago | |||||||
Yeah, seems it's more "exploring the distribution" as we don't actually know everything that the AIs are effectively modeling. | ||||||||
| ▲ | lawlessone 8 hours ago | parent [-] | |||||||
Am i understanding correctly that in distribution means the text predictor is more likely to predict bad instructions if you already get it to say the words related to the bad instructions? | ||||||||
| ||||||||