This should be what you're looking for. It doesn't utilize the GPU, but WebGL support is in the TODOs.
https://github.com/ngxson/wllama
https://huggingface.co/spaces/ngxson/wllama