Remix.run Logo
mpeg 2 hours ago

It's not even very usable... I tried 2 different chats and both eventually got stopped due to the safeguards

One was a piece of code I gave it to improve, it did so and then started writing tests, some of which tested security so the safeguards triggered

Another was one of the cryptography puzzles I use as new model tests, which are hard to oneshot and there's no public solution anywhere, it completely refused to even try to solve it

gavinray 37 minutes ago | parent | next [-]

I tried 2 chats and it declined both.

- 1st chat asked about a minor shoulder injury most likely mechanisms

- 2nd chat asked about optimal bloodwork testing markers

kranke155 24 minutes ago | parent [-]

it seems to dislike biological chats. Rejected me on a chat that I am running with 4.8 as well on a rare condition I have.

CSSer 39 minutes ago | parent | prev | next [-]

Oh joy. A model whose safeguards make it prone towards code that make your systems less safe. How brilliant!

Erem 2 hours ago | parent | prev [-]

So the degradation to Opus 4.8 from the article isn't happening in practice?

mtkd an hour ago | parent | next [-]

No, you get a AUP violation and have to manually swap the model

(I had same issue, just asked it to check some code that 4.8 had modified earlier in day)

andai an hour ago | parent | prev [-]

Maybe that's only in the chat UI, and not the API?