Remix clone Hacker News

new | show | ask | jobs Github

	▲	AlotOfReading 7 hours ago
		What training data? Many of these languages have very little digitized literature. Even if we assume they have sizeable extant corpuses (e.g. Tibetic/Bhoti), that's not enough. LLMs are still pretty garbage at English prose, for example.
	▲	general_reveal 6 hours ago \| parent [-]
		!Remind me in 1 year (certainly less than 5).