| ▲ | FeepingCreature 11 days ago | |||||||
It's good, but like many explainers it discounts the repeated nonlinear layers. Just multiplying numbers (linear operations) could not make a system you could talk to. | ||||||||
| ▲ | mr_toad 11 days ago | parent [-] | |||||||
The layers don’t have to be non-linear, but you need a non-linear activation function between them. People often overlook the importance of the network topology and the activation functions. The weights alone are not a complete description of the network. | ||||||||
| ||||||||