Remix.run Logo
kouteiheika an hour ago

The model is natively quantized (i.e. it was trained that way in the first place, so this is not a post-training quantization which degrades performance).

theanonymousone 27 minutes ago | parent [-]

But the huggingface link mentions BF16, F16, and I32?