Remix.run Logo
jboss10 4 hours ago

They can be ran on 32GB with 8GB VRAM. I don't think these will be on 16GB for a while. (35B MoE)

TheCycoONE 4 hours ago | parent [-]

I have 32GB of RAM with 16GB VRAM and I haven't had a lot of luck running larger models like this. Are you able to expand on that?

slim 4 hours ago | parent [-]

use llama.cpp with cuda

TheCycoONE 2 hours ago | parent [-]

The problem may be that it's a 7800XT which handles memory contention by freezing.