Remix.run Logo
furyofantares 4 hours ago

I find 4.6 pretty noticeable upgrade, but it might be the 1M context. I'm interested in how the 1M context works out with Qwen.

Alifatisk 2 hours ago | parent | next [-]

From Qwen-3-max thinking, I remember the inference becoming veeery slow as you pushed towards 1M context, already at 300k tokens you would notice the degradation. But of course, I was using Qwen Chat, so could be a resource allocation thing.

nwienert 4 hours ago | parent | prev [-]

I found it worse, in a very clear way.