Remix.run Logo
mythz 3 days ago

Also less power efficient, takes up more PCI slots and a lot of software doesn't support GPU clustering. Already have 4x 16GB GPUs which is unable to run large models exceeding 16GB.

Currently running them different VMs to be able to make full use of them, used to have them running in different docker containers however OOM Exceptions would frequently bring down the whole server, which running in VMs helped resolve.

zargon 3 days ago | parent [-]

What’s your application for high-VRAM that doesn’t leverage multiple gpus?