Remix.run Logo
stavros 14 hours ago

I wonder if we can use gpt-image-1 outputs, with some noise, as inputs to diffusion models, so GPT takes care of adherence and the diffusion model improves the quality. Does anyone know whether that's at all possible?

AuryGlenz 11 hours ago | parent | next [-]

Sure. I suppose with API support 3 hours ago someone probably made a Comfy node all of 2 hours ago. From there you can either just do a low denoise or use one of the many IP-Adapter type things out there.

levzzz 12 hours ago | parent | prev [-]

yes it's what a lot of people have been doing with newer models which have better prompt adherence, passing them through older models with better aesthetics