This is an in-browser llamacpp implementation: https://github.com/ngxson/wllama
And related is the whisper implementation: https://ggml.ai/whisper.cpp/