Remix.run Logo
moffkalast 2 hours ago

https://developer.mozilla.org/en-US/docs/Web/API/WebGPU_API#...

I don't think there's a whole lot of rush on your end, given the level of support WebGPU generally has at this point.

vmirnv an hour ago | parent [-]

True, but we believe that with the introduction of GGUF to WebGPU, we’ve solved the chicken-egg problem (we tried WebGPU ONNX models before with transformers.js from HG with ~50× fewer model choices). GGUF community is more mature, with a better choice of models quantizations as well.

Also, with better consumer hardware, smarter small models (up to 4B), and better WebGPU engines and competition between them (MDST Engine is still in a very early stage, with little optimization, no flash-attention yet, etc.), we think this technology will grow super fast in 2026 for a less tech-savvy market (which we believe is good for open-weight models).