| ▲ | sanderjd 6 hours ago |
| Right. Opus 4.5 8 months ago, good enough for agentic coding. How far behind that are open weight models? More than 8 months? But how much more? When will they reach Opus 4.5 level? A few months from now? A year from now? Never? |
|
| ▲ | theshrike79 5 hours ago | parent | next [-] |
| The power of Opus isn't just the model, it's in the harness too. You can try it by using Opus through Github Copilot vs official Anthropic tools. You'll get very different results and experience (in my opinion). |
| |
| ▲ | larsnystrom 4 hours ago | parent | next [-] | | I’ve only used Opus in GitHub copilot and was hugely underwhelmed. It was barely usable. Are you saying it’s better with the official Anthropic tools? | | |
| ▲ | theshrike79 2 hours ago | parent | next [-] | | Night and day in my opinion. But these are all purely Feels so YMMV etc. I like how especially the Claude Code CLI version communicates how it's progressing, something they hide a lot more on the desktop app for example. | |
| ▲ | m-ee 4 hours ago | parent | prev [-] | | I don't know about better but it's certainly different. It's painfully slow through claude code vscode extension compared to copilot but maybe "smarter", I feel like I have to correct it less using sonnet on both. I don't use opus much because of the cost but coworkers say the difference between harnesses there is also pronounced. |
| |
| ▲ | throwa356262 2 hours ago | parent | prev [-] | | open source harnesses are also improving rapidly. Some people would claim they are already far better than CC and Codex. |
|
|
| ▲ | theplumber 6 hours ago | parent | prev | next [-] |
| I think in the next 6 months we will have Opus 4.5 performance in open models. We are very close |
| |
| ▲ | krzyk an hour ago | parent [-] | | We need first to reach level of Sonnet 4.x, we aren't at that level yet. |
|
|
| ▲ | marak830 5 hours ago | parent | prev [-] |
| GLM 5.2 came out today and the early reports have been quite good. Very difficult to run except on prosumer hardware, but small business could quite easily (or something like open router). |