They're working—almost done—on a CUDA backend for their Apple Silicon framework:
https://github.com/ml-explore/mlx/pull/1983