Also image generation, particularly with latest GPT, can be finetuned a lot more than music generation which is nowadays limited to "here's something with those lyrics and genre".