| ▲ | Madmallard 2 hours ago | |
" Grok 4 Heavy wasn't considered in comparisons. Grok meets or exceeds the same benchmarks that Gemini 3 excels at, saturating mmlu, scoring highest on many of the coding specific benchmarks. Overall better than Claude 4.5, in my experience, not just with the benchmarks." I think these types of comments should just be forbidden from Hacker News. It's all feelycraft and impossible to distinguish from motivated speech. | ||