Remix.run Logo
hinkley 17 hours ago

Do you think there’s a call for introducing an even smaller float that can pack more values into a SIMD register? Like a 12 bit?

boulos 14 hours ago | parent [-]

The latest GPUs and TPUs support fp8. It's a big part of the efficiency gain in the latest systems. Blackwell also supports fp4.