| ▲ | WithinReason 10 hours ago | |||||||||||||||||||||||||||||||
It's on the page: | ||||||||||||||||||||||||||||||||
| ▲ | Aurornis 9 hours ago | parent | next [-] | |||||||||||||||||||||||||||||||
Additional VRAM is needed for context. This model is a MoE model with only 3B active parameters per expert which works well with partial CPU offload. So in practice you can run the -A(N)B models on systems that have a little less VRAM than you need. The more you offload to the CPU the slower it becomes though. | ||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||
| ▲ | est 8 hours ago | parent | prev | next [-] | |||||||||||||||||||||||||||||||
I really want to know what does M, K, XL XS mean in this context and how to choose. I searched all unsloth doc and there seems no explaination at all. | ||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||
| ▲ | JKCalhoun 8 hours ago | parent | prev | next [-] | |||||||||||||||||||||||||||||||
"16-bit BF16 69.4 GB" Is that (BF16) a 16-bit float? | ||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||
| ▲ | palmotea 9 hours ago | parent | prev [-] | |||||||||||||||||||||||||||||||
Thanks! I'd scanned the main content but I'd been blind to the sidebar on the far right. | ||||||||||||||||||||||||||||||||