Remix.run Logo
roxolotl 15 hours ago

I’m very curious if we’re going to ever get another “deepseek moment. Qwen is starting to feel like it could be one. But for it to be people would have to decide to care. It took about a month, I think mid December-mid January, from the deepseek paper for the “moment” so it doesn’t necessarily have to be right away.

try-working 15 hours ago | parent [-]

What's gone unnoticed with the Gemma 4 release is that it crowned Qwen as the small model SOTA. So for the first time a Chinese lab holds the frontier in a model category. It is a minor DeepSeek model, because western labs have to catch up with Alibaba now.

guteubvkk 14 hours ago | parent | next [-]

on my 16 GB GPU Gemma 4 is better and faster than Qwen 3.5, both at 4-bit

so it's not so clear cut

tmikaeld 5 hours ago | parent [-]

depends on usage, Gemma 4 is better on visuals/html/css and language understanding (Which probably plays a role in prompting). But it's worse at code in general compared to Qwen 3.5 27B.

lostmsu 15 hours ago | parent | prev | next [-]

It's unnoticed because it didn't. In Google's own benchmarks they are on par, and I've seen 3rd party benchmarks where Qwen beats G4 with high margin

irishcoffee 12 hours ago | parent | prev [-]

The day a western anything will need to catch up with alibaba will be a notable day indeed. Also, this will never happen.