| ▲ | mathisfun123 2 hours ago | |||||||
You don't know what you're talking about: an enormous amount of TOPs now runs through quantized (read: integer) kernels. Many GPUs don't have even FP64 or even FP32 support. | ||||||||
| ▲ | jmalicki an hour ago | parent [-] | |||||||
EDIT: I was completely wrong, I have mostly worked with GGUF and related quantizations that are LUTs, thank you for correcting me. | ||||||||
| ||||||||