| ▲ | roxolotl 15 hours ago | ||||||||||||||||||||||||||||
I’m very curious if we’re going to ever get another “deepseek moment. Qwen is starting to feel like it could be one. But for it to be people would have to decide to care. It took about a month, I think mid December-mid January, from the deepseek paper for the “moment” so it doesn’t necessarily have to be right away. | |||||||||||||||||||||||||||||
| ▲ | try-working 15 hours ago | parent [-] | ||||||||||||||||||||||||||||
What's gone unnoticed with the Gemma 4 release is that it crowned Qwen as the small model SOTA. So for the first time a Chinese lab holds the frontier in a model category. It is a minor DeepSeek model, because western labs have to catch up with Alibaba now. | |||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||