| ▲ | kouteiheika an hour ago | |
The model is natively quantized (i.e. it was trained that way in the first place, so this is not a post-training quantization which degrades performance). | ||
| ▲ | theanonymousone 27 minutes ago | parent [-] | |
But the huggingface link mentions BF16, F16, and I32? | ||