Remix clone Hacker News

new | show | ask | jobs Github

	▲	abcde666777 3 hours ago
		Chess AI isn't trained in the same way. Things like alpha zero partly worked by playing themselves recursively, meaning they actually did generate novel data in the process. That was partly possible because chess is a constrained domain: rigid rules and board states. But LLM land is not like that. LLM land was trained on pre-existing text written by humans. They do discover patterns within said data but the point stands that the data and patterns within are not actually novel.
	▲	charcircuit 3 hours ago \| parent [-]
		>LLM land was trained on pre-existing text written by humans. Some of the pretraining. Other pretraining is on text written by AI. Human training data is only but a subset of what these models train on. There is a ton of synthetic training data now.