| ▲ | madagang 2 days ago | |||||||||||||
Their Chinese announcement says that, based on internal employee testing, it is not as good as Opus 4.6 Thinking, but is slightly better than Opus 4.6 without Thinking enabled. | ||||||||||||||
| ▲ | mchusma 2 days ago | parent | next [-] | |||||||||||||
I appreciate this, makes me trust it more than benchmarks. | ||||||||||||||
| ▲ | ibic 2 days ago | parent | prev | next [-] | |||||||||||||
In case people wonder where the announcement is (you can easily translate it via browser if you don't read Chinese): https://mp.weixin.qq.com/s/8bxXqS2R8Fx5-1TLDBiEDg It's still a "preview" version atm. | ||||||||||||||
| ▲ | deaux 2 days ago | parent | prev | next [-] | |||||||||||||
That's super interesting, isn't Deepseek in China banned from using Anthropic models? Yet here they're comparing it in terms of internal employee testing. | ||||||||||||||
| ||||||||||||||
| ▲ | anentropic 2 days ago | parent | prev [-] | |||||||||||||
Who uses Opus without thinking though...? | ||||||||||||||