Remix clone Hacker News

new | show | ask | jobs Github

	▲	applfanboysbgon an hour ago
		> Deep seek 3.2 is 4% on Arc-AGI 2 Why are you bringing up an outdated Chinese model from 6 months ago to compare to a US model from 6 months ago? The outdated Chinese model will have performance from ~12 months ago, obviously. But today's Chinese model DeepSeek 4 has performance not far from the US model 6 months ago; 46% compared to 52% from 5.2.
	▲	gpt5 10 minutes ago \| parent [-]
		Because Deepseek 4.0 is not yet there, but the jump isn't expected to be large. Kimi 2.5 is there and is also scoring low.