| ▲ | saberience 4 hours ago | |
Benchmarks aren't everything. Gemini consistently has the best benchmarks but the worst actual real-world results. Every time they announce the best benchmarks I try again at using their tools and products and each time I immediately go back to Claude and Codex models because Google is just so terrible at building actual products. They are good at research and benchmaxxing, but the day to day usage of the products and tools is horrible. Try using Google Antigravity and you will not make it an hour before switching back to Codex or Claude Code, it's so incredibly shitty. | ||
| ▲ | mustaphah 4 hours ago | parent | next [-] | |
That's been my experience too; can't disagree. Still, when it comes to tasks that require deep intelligence (esp. mathematical reasoning [1]), Gemini has consistently been the best. | ||
| ▲ | gregorygoc 4 hours ago | parent | prev [-] | |
What’s so shitty about it? | ||