Remix.run Logo
Madmallard 2 hours ago

" Grok 4 Heavy wasn't considered in comparisons. Grok meets or exceeds the same benchmarks that Gemini 3 excels at, saturating mmlu, scoring highest on many of the coding specific benchmarks. Overall better than Claude 4.5, in my experience, not just with the benchmarks."

I think these types of comments should just be forbidden from Hacker News.

It's all feelycraft and impossible to distinguish from motivated speech.