▲ | whatever1 4 days ago | |
Thanks! It does feels to me that we do some sort of sampling, definitely is not a naive grid search. Also I find it easier to find the minima in specific directions (up, down, left, right) rather than let’s say a 42 degree one. So some sort of priors are probably used to improve sample efficiency. |