Remix.run Logo
smith7018 7 hours ago

Do you have benchmarks or at least anecdotes to back that up? I'm not arguing with you; I would just love to see some proof that open models are getting as good as Anthropic's models.

redox99 6 hours ago | parent | next [-]

I've been running some test prompts comparing frontier models for webdev, particularly pretty visualizations, physics / orbital simulations, etc.

Do note that GLM is not multi modal, which can be a deal breaker. And these open models are not good outside coding.

unrvl22 6 hours ago | parent | prev [-]

look at benchmarks, use the model yourself. Im usually first to call BS on every chinese model that says they are as good as Opus. this is finally the first one that actually is. It is a massive jump from every other previous chinese model.

smith7018 6 hours ago | parent [-]

"use the model yourself"

I wish I had the time to set it up and work on side projects but unfortunately life and work have been crazy (as I'm sure many here feel). That's why I asked for anecdotes about it.