Remix.run Logo
nullc 2 hours ago

From the paper it appears that it's probably more useful on small-ish models.

lwansbrough 21 minutes ago | parent [-]

What does it cost to train a model like 1-bit Bonsai? Anyone know?