Er, CPU inferencing? :) I didn't think I mentioned that!
The Framework Desktop thing is that has unified memory with the GPU, so much like an M-series, you can inference disproportionately large models.