| ▲ | scrlk 6 hours ago | |||||||
IME, unquantised -> FP8 is pretty much lossless. What matters more is having an unquantized KV cache - using an FP8 KV cache can result in a significant drop in quality. | ||||||||
| ▲ | johnnyApplePRNG 3 hours ago | parent | next [-] | |||||||
>unquantised -> FP8 is pretty much lossless Claude Shannon is rolling in his grave. | ||||||||
| ▲ | ComputerGuru 4 hours ago | parent | prev [-] | |||||||
Do infra providers reveal that level of implementation detail? | ||||||||
| ||||||||