Remix.run Logo
siva7 6 hours ago

Interesting. When i asked Gemini 3 Pro to generate a Infographic from my personal accounting sheet, it first failed to generate anything except a black background, then it generated something where it mixed different languages in a non-sensical way, with obvious typos and irrelevant information grouping. It's certainly a leap forward in OCR, rendering classic OCR useless.

minimaxir 6 hours ago | parent [-]

That's more of an issue with Nano Banana Pro than with Gemini 3 Pro.

siva7 5 hours ago | parent [-]

What's the difference? I thought the vision ai component of gemini 3 is called nano banana?

IanCal 5 hours ago | parent | next [-]

That’s about generating images, the other side is about understanding images.

brokensegue 5 hours ago | parent | prev [-]

i assumed nano banana was just a tool that gemini 3 used though i don't know

minimaxir 5 hours ago | parent [-]

Gemini 3 Pro's text encoder powers Nano Banana Pro, but it has its own image decoding model that decodes the generated image tokens into an actual image, which appears to be the more pertinent issue in this case.