Am I reading this wrong, or does this only support FP16 inputs, and compares its performance against an FP32 solver?