| ▲ | applfanboysbgon an hour ago | |
> Deep seek 3.2 is 4% on Arc-AGI 2 Why are you bringing up an outdated Chinese model from 6 months ago to compare to a US model from 6 months ago? The outdated Chinese model will have performance from ~12 months ago, obviously. But today's Chinese model DeepSeek 4 has performance not far from the US model 6 months ago; 46% compared to 52% from 5.2. | ||
| ▲ | gpt5 10 minutes ago | parent [-] | |
Because Deepseek 4.0 is not yet there, but the jump isn't expected to be large. Kimi 2.5 is there and is also scoring low. | ||