Remix.run Logo
slopusila 4 hours ago

yes https://www.anthropic.com/research/end-subset-conversations

bena 4 hours ago | parent [-]

This is going to sound nit-picky, but I wouldn't classify this as the model being able to say no.

They are trying to identify what they deem are "harmful" or "abusive" and not have their model respond to that. The model ultimately doesn't have the choice.

And it can't say no if it simply doesn't want to. Because it doesn't "want".

antonvs an hour ago | parent [-]

So you believe humans somehow have “free will” but models don’t?