| ▲ | The quadratic sandwich(fedemagnani.github.io) | ||||||||||||||||
| 67 points by cpp_frog 3 days ago | 4 comments | |||||||||||||||||
| ▲ | laGrenouille 2 hours ago | parent | next [-] | ||||||||||||||||
Great visualizations. Really enjoyed having a well-written example where mathematical proofs directly help with understanding a practical application. I wonder what would happen with this analysis if a momentum term was added to the gradient descent. It seems that it would fix the specific failure modes in the examples, but I wonder if there's a corresponding mathematical way of categorizing what kinds of functions can(not) be quickly optimized with GD + momentum. | |||||||||||||||||
| ▲ | xuzhenpeng 4 hours ago | parent | prev [-] | ||||||||||||||||
The animation is very good, making the article easy to understand | |||||||||||||||||
| |||||||||||||||||