This model does image to image; whats the issue with Qwen 3 VL; is style transfer unexplained? " reference" is mentioned 11 times on the page (more specifically, I read it and it seemed to discuss it a lot)