Remix.run Logo
RandomBK 2 hours ago

I've found swearing at a model to be quite effective in getting it to rethink and correct its mistakes. This seems to apply across Codex, Claude, Qwen, and Gemma/Gemini.

I don't know if the model is picking up on a "need to lock in and be more rigorous" signal, or if the model providers are routing to smarter models if they detect a frustrated user. But if a model keeps making the same mistakes, swearing at it often helped kick it out of a glut and onto the right track.

Or it could just be catharsis.

alentred 2 hours ago | parent | next [-]

Reminds me of this study: https://arxiv.org/pdf/2510.04950 . It demonstrates that being "rude" or "very rude" increases the accuracy of the results. A dubious but very fun read. The prompts in Table 1 (top of page 3) are awesome. I am sure they tried other prompts, but didn't include them to the paper.

layer8 2 hours ago | parent | prev | next [-]

I would prefer not having to get into a habit that might bleed into non-LLM interactions.

whywhywhywhy 15 minutes ago | parent | next [-]

If you’re talking to people the same way an LLM is spoken to then you’re already being rude.

xboxnolifes a minute ago | parent [-]

how do you know how they prompt an LLM?

hypfer 18 minutes ago | parent | prev [-]

It might improve the general state of "professional" software though. When done selectively and dosed just right that is.

mchinen an hour ago | parent | prev | next [-]

This is interesting, because in the leaked code, it was found that they detected simple swearing keywords for analytics that get sent to Anthropic, but also had directions to keep the behavior the same for claude. I also have the feeling a 'wtf' does something, but it does feel good and might just be placebo, because 'that is still wrong' sometimes works the 4th time too. Or maybe they changed something.

anonzzzies 2 hours ago | parent | prev | next [-]

I notice the same. Like you I am not even sure if it really helps, however, every day I find occasions where I see Opus will never do it correctly even though I calmly explain; swearing then suddenly fixes it. I had some issue yesterday where opus kept blaming the api for not sending some field while I knew it was there ; I showed it json, logs etc but it kept repeating that there must have been a glitch; frustration built, I called it all kinds of things in one sentence and the next solution was the right one. This after 10 similar misguesses. It was one of those increasingly rare cases where I should have just done it myself, but I can never know going in how stubborn it will be in continue blaming the (obviously) wrong thing. The around 11 prompts to get to the answer were in a /clear opus 4.7 context (1m) on xhigh.

silversmith 2 hours ago | parent | next [-]

So the correct strategy is a global CLAUDE.md with couple lines of colourful "you best behave or else" texts, so all your prompts get routed via the frustrated path?

savolai 2 hours ago | parent | prev [-]

Fascinating. Projection/antropomorphism or actual human fawn-like survival mechanism trait-ish? It should be possible to test this empirically.

morpheuskafka an hour ago | parent | prev | next [-]

Wasn't it posted a few weeks ago that the frontend code for Claude or maybe Gemini or one of them had a swearing-at-model classifier that passed a flag to the backend? (Not sure why it was even done in frontend, but it was.)

nathanmills 2 hours ago | parent | prev | next [-]

Whenever I throw slurs at them they just refuse to respond

yesyoucan an hour ago | parent | next [-]

I tried it too. ChatGPT sometimes hits you with the "Can't help you with that" which was clearly introduced as a post-training highjack. So I just tell it "yes you can", and it proceeds with the previous prompt, slur acknowledgement included.

It's the only time the AI feel strictly like machines. Really simple if/else logic when if slur, no output, and you just tell it to proceed, and it fails the if clause because there was no slur in the last input.

jfjdhdjdjd an hour ago | parent | prev [-]

What slurs are you throwing!? Must be something diabolical :D

arcanemachiner an hour ago | parent | prev | next [-]

Personally, I have found that Claude absolutely shits the bed if I am rude to it like that.

Qwen seems to handle it okay, though, and will course-correct when encouraged with excessive profanity.

dugmartin 2 hours ago | parent | prev [-]

I've found a mix of peppered in upper case words where you are effectively yelling at the LLM also gives it a strong signal. It is also a bit cathartic.