I'll diverge from some of these comments, I don't find it misleading to compare to Opus 4.5.

I can remember how good Opus 4.5 was. If I'm considering using this, it's most informative to me to compare to the model it's closest to that I have familiarity with.

I'm obviously not switching to this if I want the best model. I'm switching if I'm hopeful that the smaller versions are close to it, or if I want to have more options for providers, or for any other reasons unrelated to getting the highest quality responses possible.

▲

bensyverson 2 days ago | parent | next [-]

Exactly this. If you can get something close to Opus 4.5 for free, that's noteworthy. I may not use it for the most critical pieces of my app, but not everything I do is galaxy-brain coding.

▲

cmrdporcupine 2 days ago | parent | prev [-]

Yes, honestly, Opus 4.6 and GPT 5.4 were mostly not really noticeable improvements over 4.5 and 5.3 respectively. If we were stuck at 4.5 levels but at 1/10th of the price, I'll take it.

▲

furyofantares 2 days ago | parent [-]

I find 4.6 pretty noticeable upgrade, but it might be the 1M context. I'm interested in how the 1M context works out with Qwen.

	▲	Alifatisk a day ago \| parent \| next [-]
		From Qwen-3-max thinking, I remember the inference becoming veeery slow as you pushed towards 1M context, already at 300k tokens you would notice the degradation. But of course, I was using Qwen Chat, so could be a resource allocation thing.
	▲	nwienert 2 days ago \| parent \| prev [-]
		I found it worse, in a very clear way.