Remix.run Logo
cubefox 3 hours ago

Yeah, "1.58 bit" is 1 trit with three states, since log2(3)≈1.58.

So it's not a inference framework for 1-bit models (two states per parameter) but for 1.58 bit models (three states per parameter). Annoying that they try to mix up the two.

silon42 2 hours ago | parent [-]

I always hope for "just a bunch of if statements" ... this is not it.

himata4113 2 hours ago | parent [-]

it's if {} else if {} else {}