| ▲ | ACCount37 3 hours ago | |||||||
Good to have more open weight models, and I really appreciate the in-depth write-up. I also like the "keep the manifold wide" approach of trying to make a model capable of many styles as opposed to getting it "dialed in" for a dozen of style presets. But it does feel very much like "fighting the past war" - now that advanced "image-to-image"/"agentic composition" models like Nano Banana 2 or Images 2.0 are out there in force. I seriously doubt that the basic Qwen 3 VL in cross can get anywhere near that level of I2I. And robust I2I is very desirable - editing, adjustment, character consistency, the generalization of whatever you're doing with style transfer now (underexplained BTW). Trying to hit that level of I2I is not by any means easy, but it's pretty clear to me that this is where the next frontier for image models lies. Feels like Ideogram might be building up to it, but I'm yet to see it anywhere else in open weight space. | ||||||||
| ▲ | dvrp 2 hours ago | parent | next [-] | |||||||
I appreciate the skepticism but we find internally that this model is used more than Nano Banana for many cases like moodboarding (also, 4x cheaper than NBP never hurts). Agentic workflows are compatible with Krea 2 so I’m not sure I follow there. If you are talking about an edit model, that’s coming too. Also, we are on par with them in t2i benchmarks, check the artificial analysis link I posted in my top comment. And you cannot re-train nano banana or ChatGPT to understand your brand, which is what our customers complain about constantly. Plus open-source! It’s hard to do an apple to apple comparison. | ||||||||
| ||||||||
| ▲ | refulgentis 2 hours ago | parent | prev [-] | |||||||
This model does image to image; whats the issue with Qwen 3 VL; is style transfer unexplained? " reference" is mentioned 11 times on the page (more specifically, I read it and it seemed to discuss it a lot) | ||||||||