Remix clone Hacker News

new | show | ask | jobs Github

	▲	XCSme 2 hours ago
		Now I need to write more tests. It's a bit hard to trick reasoning models, because they explore a lot of the angles of a problem, and they might accidentally have an "a-ha" moment that leads them on the right path. It's a bit like doing random sampling and stumbling upon the right result after doing gradient descent from those points.