Remix.run Logo
muyuu 2 hours ago

i have a Strix Halo machine

typically those dense models are too slow on Strix Halo to be practical, expect 5-7 tps

you can get an idea by looking at other dense benchmarks here: https://strixhalo.zurkowski.net/experiments - i'd expect this model to be tested here soon, i don't think i will personally bother

hedgehog 2 hours ago | parent [-]

This one is around 250 t/s prefill and 12.4 generation in my testing.