Remix.run Logo
semessier 7 days ago

still looking for vLLM to support Mac ARM Metal GPUs

baggiponte 7 days ago | parent [-]

Yeah. The docs tell you that you should build it yourself, but…

tough 6 days ago | parent [-]

but unlike cuda there's no custom kernels for inference in vllm repo...

I think