Remix clone Hacker News

new | show | ask | jobs Github

	▲	mschuster91 2 hours ago
		The core problem is that AI training processes can't by itself know during training that a part of the training dataset is bad. Basically, a normal human with some basic media literacy knows that tabloids, the "yellow press" rags, Infowars or Grokipedia aren't good authoritative sources and automatically downranks their content or refuses to read it entirely. An AI training program however? It can't skip over B.S., it relies on the humans compiling the dataset - otherwise it will just ingest it and treat it as 1:1 ranked with authoritative, legitimate sources.