Remix clone Hacker News

	▲	Tinkeringz 6 days ago
		Aren’t all of these reasoning models? Won’t the reasoning models of openAI benchmarked against these be a test of if Sam is losing?
	▲	maeil 6 days ago \| parent \| next [-]
		Sonnet 3.7 non-reasoning is better on its own. In fact even Sonnet 3.5-v2 is, and that was released 6 months ago. Now to be fair, they're close enough that there will be usecases - especially non-coding - where 4.1 beats it consistently. Also, 4.1 is quite a lot cheaper and faster. Still, OpenAI is clearly behind.
	▲	atemerev 6 days ago \| parent \| prev \| next [-]
		There is no OpenAI model better than R1, reasoning or not (as confirmed by the same Aider benchmark; non-coding tests are less objective, but I think it still holds). With Gemini (current SOTA) and Sonnet (great potential, but tends to overengineer/overdo things) it is debatable, they are probably better than R1 (and all OpenAI models by extension).
	▲	vitorgrs 6 days ago \| parent \| prev [-]
		Even without reasoning, isn't Deepseek V3 from March better?