I don't understand why the open source model providers don't also publish the quantized version?
They sometimes do! Qwen, Google etc do them!