▲ | rao-v 5 days ago | ||||||||||||||||
Nice! Cheap RK3588 boards come with 15GB of LPDDR5 RAM these days and have significantly better performance than the Pi 5 (and often are cheaper). I get 8.2 tokens per second on a random orange pi board with Qwen3-Coder-30B-A3B at Q3_K_XL (~12.9GB). I need to try two of them in parallel ... should be significantly faster than this even at Q6. | |||||||||||||||||
▲ | jerrysievert 5 days ago | parent | next [-] | ||||||||||||||||
> a random orange pi board with Qwen3-Coder-30B-A3B at Q3_K_XL (~12.9GB) fantastic! what are you using to run it, llama.cpp? I have a few extra opi5's sitting around that would love some extra usage | |||||||||||||||||
| |||||||||||||||||
▲ | ThatPlayer 4 days ago | parent | prev [-] | ||||||||||||||||
Is that using the NPU on that board? I know it's possible to use those too. | |||||||||||||||||
|