Remix.run Logo
kelnos 6 days ago

Part of the problem to me is that these models are so damned agreeable. I haven't used ChatGPT in a while, but Claude is always assuming I'm right whenever I question something. I have to explicitly tell it not to assume I'm right, and to weigh my question with what it suggested. Maybe if they were trained to treat questions more skeptically, this kind of thing wouldn't happen.

And they're so "friendly"! Maybe if they weren't so friendly, and replied a little more clinically to things, people wouldn't feel so comfortable using them as a poor substitute for a therapist.

teaearlgraycold 6 days ago | parent [-]

I really want the LLMs to respond like a senior developer that doesn't have time for you but needs you to get your job done right. A little rude and judgemental, but also highly concise.

blackqueeriroh 6 days ago | parent [-]

You say that now, but how they actually behave says that you’d probably get tired of it.

teaearlgraycold 6 days ago | parent [-]

I’m not the one providing the RLHF feedback. They’re optimized for the lowest common denominator.