Yeah look at their open source models and how you get such high parameters in such low vram
Its impressive but a regression for now, in direct comparison to just high parameter model