Remix.run Logo
thegeomaster 18 hours ago

Well, there's also gemini-2.0-flash-exp-image-generation. Also autoregressive/transfusion based.

thefourthchime 18 hours ago | parent | next [-]

Such a good name....

Yiling-J 14 hours ago | parent | prev | next [-]

gemini-2.0-flash-exp-image-generation doesn’t perform as well as GPT-4o's image generation, as mentioned in section 5.1 of this paper: https://arxiv.org/pdf/2504.02782. However based on my test, for certain types of images such as realistic recipe images, the results are quite good. You can see some examples here: https://github.com/Yiling-J/tablepilot/tree/main/examples/10...

raincole 11 hours ago | parent | prev | next [-]

It's quite bad now, but I have no doubt that Google will catch up.

The AI field looks awfully like {OpenAI, Google, The Irrelevent}.

yousif_123123 17 hours ago | parent | prev | next [-]

It's also good but clearly not close still. Maybe Gemini 2.5 or 3 will have better image gen.

swyx 13 hours ago | parent | prev [-]

> transfusion based.

what is that?