| ▲ | cubefox 7 hours ago | |||||||||||||||||||||||||||||||||||||||||||
In your example, z-image and Nano Banana Pro look basically equally photorealistic to me. Perhaps the NBP image looks a bit more real because it resembles an unstaged smartphone shot with wide angle. Anyway, the difference is very small. I agree the lighting in Flux.2 Pro looks a bit off. But anyway, realistic environments like a street cafe are not suited to test for photorealism. You have to use somewhat more fantastical environments. I don't have access to z-image, but here are two examples with Nano Banana Pro: "A person in the streets of Atlantis, portrait shot." https://i.ibb.co/DgMXzbxk/Gemini-Generated-Image-7agf9b7agf9... "A person in the streets of Atlantis, portrait shot (photorealistic)" https://i.ibb.co/nN7cTzLk/Gemini-Generated-Image-l1fm5al1fm5... These are terribly unrealistic. Far more so than the Flux.2 Pro image above. > Also Imagen 4 and Nano Banana Pro are very different models. No, Imagen 4 is a pure diffusion model. Nano Banana Pro is a Gemini scaffold which uses Imagen to generate an initial image, then Gemini 3 Pro writes prompts to edit the image for much better prompt alignment. The prompts above a very simple, so there is little for Gemini to alter, so they look basically identical to plain Imagen 4. Both pictures (especially the first) have the signature AI look of Imagen 4, which is different from other models like Imagen 3. By the way, here is GPT Image 1.5 with the same prompts: "A person in the streets of Atlantis, portrait shot." https://i.ibb.co/Df8nDHFL/Chat-GPT-Image-10-Feb-2026-14-17-1... "A person in the streets of Atlantis, portrait shot (photorealistic)" https://i.ibb.co/Nns4pdGX/Chat-GPT-Image-10-Feb-2026-14-17-2... The first is very fake and the second is a strong improvement, though still far from the excellent cafe shots above (fake studio lighting, unrealistic colors etc). | ||||||||||||||||||||||||||||||||||||||||||||
| ▲ | GaggiX 6 hours ago | parent [-] | |||||||||||||||||||||||||||||||||||||||||||
>In your example, z-image and Nano Banana Pro look basically equally photorealistic to me I disagree, nano banana pro result is on a completely different league compare to flux.2 and z-image. >But anyway, realistic environments like a street cafe are not suited to test for photorealism Why? It's the perfect settings in my opinion. Btw I don't think you are using nano banana pro, probably standard nano banana, I'm getting this from your prompt: https://i.ibb.co/wZHx0jS9/unnamed-1.jpg >Nano Banana Pro is a Gemini scaffold which uses Imagen to generate an initial image, then Gemini 3 Pro writes prompts to edit the image for much better prompt alignment. First of all how should you know the architecture details of gemini-3-pro-image, second of all how the model can modify the image if gemini itself is just rewriting the prompt (like old chatgpt+dalle), imagen 4 is just a text-to-image model, not an editing one, it doesn't make sense, nano banana pro can edit images (like the ones you can provide). | ||||||||||||||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||||||||||||||