> the GPU is limited by the Thunderbolt port
Not everything is limited by the transfer speed to/from the GPU. LLM inference, for example.