Remix.run Logo
pishpash 3 hours ago

Gemini (at least public free version) hallucinates way too much. If it's like that, it can go very badly for Apple.

ComputerGuru 3 hours ago | parent | next [-]

I used Gemini exclusively via the API but downloaded the app last week for something. Even on max settings, it is ridiculously nerfed!

hypfer 2 hours ago | parent [-]

Unfortunately, even the API variant got RLHF'd pretty hard into being that dumb end-user assistant personality :(

But beside that, I feel like the app variant got worse the day they've had that wwdc-style release thing recently.

Previously it was a sparring partner that could actually keep up. But now it just doesn't.

Truly a shame. And nothing that could be fixed by local models any time soon, given that you need the size for the (cross-)domain knowledge.

t0mas88 3 hours ago | parent | prev [-]

The public version of Gemini is ridiculous. At least half their search "answers" are just wrong. If you then start a follow up chat the answers change but usually still half wrong.

Search would be better without the added AI hallucinations above it. If I want an AI answer I'll go and ask Claude, the quality difference is huge.

tonfa 3 hours ago | parent | next [-]

> The public version of Gemini is ridiculous. At least half their search "answers" are just wrong.

That's not Gemini, that's AI Mode (in Search), they're different products built by fairly different part of Google (actually one is built by Deepmind).

(I don't think it's much comparable to https://gemini.google.com/app at least in the past you'd get very different results)

trollbridge an hour ago | parent [-]

And it's extremely poor marketing by Google to do this - the general perception people have is that Google AI is dumb due to this.

dyauspitr 3 hours ago | parent | prev [-]

It has to be really because think of how fast it has to come up with an answer (ie time for a regular google query) and the immense scale of billions of people querying it many times a day, all for free.

pishpash 38 minutes ago | parent [-]

Just like search itself, caching does wonders. What do 90% of the people ask anyway but mundane, totally predictable questions?