▲ | cl3misch 5 days ago | |
The previous comment highlights an example where backprop is confused with "a supervised learning algorithm". My comment was about "confusing backpropagation with gradient descent (or any optimizer)." For me the connection is pretty clear? The core issue is confusing backprop with minimization. The cited article mentioning supervised learning specifically doesn't take away from that. |