Remix clone Hacker News

new | show | ask | jobs Github

	▲	gopalv 2 hours ago
		> Well, there is also a big difference that it will not learn over time. My work is in tick-tock loop of learning - learn without modifying weights, demonstrate learnings to human, but then lock it back in (accumulate and spread). This looks less like training and more like mentoring. Getting a human to mentor an agent is a hard UX task, but the learning loop is not a technological problem anymore. We can only get a tick once a week, no matter how many tocks we can do an hour.