Remix clone Hacker News

new | show | ask | jobs Github

	▲	ygouzerh a year ago
		So from what I understand it actually means that they were for example never trained on a video of an apple. Maybe only on a video of bread, pineapple, chocolate. However, as it was trained using generic text data similarly to a normal LLM, it knows how an apple is supposed to look like. Similar than a kid that never saw a banana, but his parent described it to him.