It's a terrible analogy because LLMs are already for audio what LLMs are for audio. You can use LLMs to create new songs and sounds. Encyclopedias don't create new songs.