Remix.run Logo
jakub_g 3 hours ago

Since talking images, are there any AI models that can output real transparent gifs/pngs?

And not a (botched) fake white/gray grid background that is commonly used to visualize transparency?

dyates 2 hours ago | parent | next [-]

ChatGPT's image generator has been able to do this since last year. That NBP still can't is baffling. They should at least train it to respond to requests for transparency with a solid colour pink background.

vunderba 27 minutes ago | parent [-]

This. Gpt-image-1/1.5 are the only ones that have this built in - though I'd love to have an insider view if its natively considering the alpha channel or just feeding it through a rembg-style post processor.

RobinL 11 minutes ago | parent [-]

Certainly the initial versions were post processing rather than native. I'd be interested to know if that has changed on subsequent releases.

minimaxir 3 hours ago | parent | prev [-]

You can output to a plain background and use any number of tools to mask it.

jakub_g 3 hours ago | parent [-]

I know. It sounds like a perfect task for AI to do it though (wasn't the whole premise of AI do to mundane things for us), yet they fail to do it, and I need to use an external tool.

minimaxir 3 hours ago | parent [-]

Alpha is a 4th image channel that 99%+ of images in the training data do not use, so it makes more pragmatic sense to just not allow it.