| ▲ | osti 10 hours ago | |||||||||||||||||||||||||
Given that DeepSwe is one of the very few coding benchmarks worth taking a look at, this achieves rather excellent result at it (not far from opus 4.8). From looking at the results and my own impression of 5.1 and other models, I think this is the best Chinese coding model by some non-insignificant margin. | ||||||||||||||||||||||||||
| ▲ | LaurensBER 10 hours ago | parent [-] | |||||||||||||||||||||||||
I've been very pleased with it's performance over the last few days. It's definitely not near Opus 4.8 level but it's very impressive nonetheless and it does do design extremely well. | ||||||||||||||||||||||||||
| ||||||||||||||||||||||||||