| ▲ | scaredginger an hour ago | |||||||||||||
Bit of a nitpick, but I think his terminology is wrong. Like RL, pretraining is also a form of *un*supervised learning | ||||||||||||||
| ▲ | cubefox an hour ago | parent [-] | |||||||||||||
Usual terminology for the three main learning paradigms: - Supervised learning (e.g. matching labels to pictures) - unsupervised learning / self-supervised learning (pretraining) - reinforcement learning Now the confusing thing is that Dwarkesh Patel instead calls pretraining "supervised learning" and you call reinforcement learning a form of unsupervised learning. | ||||||||||||||
| ||||||||||||||