Remix.run Logo
julianlam 4 hours ago

Last time I tried Gemma 4 (26B-A4B) its memory usage would balloon and consume all of my swap until my machine died.

Qwen 3.6 on the other hand barely uses any memory at all for its KV cache.

verdverm 3 hours ago | parent [-]

Turns out when you block people from the best and biggest hardware, they get innovative. It reminds me of the Pentium days when everyone was shipping inefficient programs because the processor would be better next year.

iknowstuff 14 minutes ago | parent [-]

we never stopped doing that!