Remix.run Logo
phkahler 34 minutes ago

So from CDNA3 to 4 they doubled fp16 and fp8 performance but cut fp32 and fp64 by half?

Wonder why the regression on non-AI workloads?

bigdict 18 minutes ago | parent [-]

cuz area and power