Remix clone Hacker News

new | show | ask | jobs Github

	▲	madduci 21 hours ago
		The first users of this dataset will be Big Tech corps. Meta, Alphabet, OpenAI, Microsoft, Apple will all be happy to use this dataset for training their LLMs. For them, 300TB is just cheap
	▲	ipsum2 19 hours ago \| parent [-]
		They already have this data. See jukebox from OpenAI, released before chatgpt.