Remix.run Logo
nycdatasci 3 hours ago

And yet 300+140=460. A very jagged surface indeed. https://gemini.google.com/share/c2a187275e26

sigbeta 4 minutes ago | parent | next [-]

Why would you use an LLM for this? They are non deterministic models.

This is also an probably part of extended prompt that disallowed coding, Gemini always does calculation with a little python snippet because it is deterministic and accurate.

dist-epoch 3 hours ago | parent | prev [-]

Was that part of a bigger prompt?

Flash 3.5 fails exactly like in your sample: https://gemini.google.com/share/97521a8752d9

but Flash 3.1 Lite initially fails, but then corrects itself: https://gemini.google.com/share/dc0889ec85ba

happyopossum 36 minutes ago | parent [-]

No matter what I try I can’t get Gemini to give me the incorrect result. Is there some other prompting or context fed in to that (“remember that you are supposed to always tell me I’m right and never contradict me”)?

sigbeta 3 minutes ago | parent [-]

There was definitively an pre prompt fed to that. I cannot reproduce this result on either 3.1 flash or 3.5 flash.