Its a long article and one of the first points "google strikes back." Is completely wrong ime. Not only is Gemini much worse than all the other models. The latest release is now so bad it is almost useless half the time or more. Hard to read more with such a bad take what I've seen myself. I don't care what benchmarks it beats if it just churns out comically bad results to me.

▲

Crash0v3rid3 11 hours ago | parent [-]

Mind sharing some examples of bad results you've seen vs other LLMs?

	▲	citizenpaul 3 hours ago \| parent [-]
		1. Seems to forget its context about 20/80 of results now. It used to be decent but now I may make only two prompts and it forgets the previous one noticeably more. 2. Results are noticeably worse, much more prone to "cheating" outcomes like generating some logic then = true to all results so it always finishes regardless of conditions.