The price, processed tokens, and output can be anything, it just depends on what GPU it is.
Nvidia GPUs are much more efficient than Apple hardware for inference(and training).