You can already do this, but the platformised generative AI is sloppy by comparison and not that interesting.
https://github.com/acids-ircam/RAVE