Remix.run Logo
embedding-shape 6 hours ago

> Also missing from these discussions are e.g. Qwen, which is at least as good as one back from OpenAI or Anthropic’s frontiers.

They're missing in the discussion because the ones you can run locally, aren't actually "one step away from other closed-source labs" in practice when you use them. They might benchmark as such, but they're sadly far away from measuring up to those scores except for very specific use cases, even when you have say 96GB of VRAM available to run the bigger models even most (at home) consumers won't be able to run.

JumpCrisscross 6 hours ago | parent [-]

> the ones you can run locally, aren't actually "one step away from other closed-source labs"

And they probably won’t be for at least another decade. Comparing like with like, flagship model running on the best hardware it can run on, Qwen is close.

embedding-shape 6 hours ago | parent [-]

> Qwen is close

I wish so badly this was true, but sadly today it just isn't.

JumpCrisscross 6 hours ago | parent [-]

To be clear, I’m relaying my subjective experience comparing Opus and Qwen.