Remix clone Hacker News

new | show | ask | jobs Github

	▲	rhaps0dy 4 days ago
		The easiest thing to do is probably use A* to find the solution, then imitation learning on the NN to learn it. (The immediate feedback gets rid of the vanishing/exploding gradients problem).