Remix.run Logo
Bukhmanizer 19 hours ago

The point of the article is boring but training LLMs on documents from a particular time period is actually pretty interesting.

ijk 19 hours ago | parent [-]

Assembling 6GB of training data is actually rather impressive, given the constraints.