| ▲ | muyuu 2 hours ago | |
i have a Strix Halo machine typically those dense models are too slow on Strix Halo to be practical, expect 5-7 tps you can get an idea by looking at other dense benchmarks here: https://strixhalo.zurkowski.net/experiments - i'd expect this model to be tested here soon, i don't think i will personally bother | ||
| ▲ | hedgehog 2 hours ago | parent [-] | |
This one is around 250 t/s prefill and 12.4 generation in my testing. | ||