Remix.run Logo
ai-christianson 6 days ago

This was trained on 6T tokens. Neat to see so many tokens used for such a small model.