Remix.run Logo
naasking 3 days ago

Interesting idea, but I hope people just start switching to ParoQuant and eliminate basically all quantization errors relative to fp16/bf16 even going down to 4-bits:

https://github.com/z-lab/paroquant