| ▲ | RandomBK 2 hours ago | ||||||||||||||||||||||
I've found swearing at a model to be quite effective in getting it to rethink and correct its mistakes. This seems to apply across Codex, Claude, Qwen, and Gemma/Gemini. I don't know if the model is picking up on a "need to lock in and be more rigorous" signal, or if the model providers are routing to smarter models if they detect a frustrated user. But if a model keeps making the same mistakes, swearing at it often helped kick it out of a glut and onto the right track. Or it could just be catharsis. | |||||||||||||||||||||||
| ▲ | alentred 2 hours ago | parent | next [-] | ||||||||||||||||||||||
Reminds me of this study: https://arxiv.org/pdf/2510.04950 . It demonstrates that being "rude" or "very rude" increases the accuracy of the results. A dubious but very fun read. The prompts in Table 1 (top of page 3) are awesome. I am sure they tried other prompts, but didn't include them to the paper. | |||||||||||||||||||||||
| ▲ | layer8 2 hours ago | parent | prev | next [-] | ||||||||||||||||||||||
I would prefer not having to get into a habit that might bleed into non-LLM interactions. | |||||||||||||||||||||||
| |||||||||||||||||||||||
| ▲ | mchinen an hour ago | parent | prev | next [-] | ||||||||||||||||||||||
This is interesting, because in the leaked code, it was found that they detected simple swearing keywords for analytics that get sent to Anthropic, but also had directions to keep the behavior the same for claude. I also have the feeling a 'wtf' does something, but it does feel good and might just be placebo, because 'that is still wrong' sometimes works the 4th time too. Or maybe they changed something. | |||||||||||||||||||||||
| ▲ | anonzzzies 2 hours ago | parent | prev | next [-] | ||||||||||||||||||||||
I notice the same. Like you I am not even sure if it really helps, however, every day I find occasions where I see Opus will never do it correctly even though I calmly explain; swearing then suddenly fixes it. I had some issue yesterday where opus kept blaming the api for not sending some field while I knew it was there ; I showed it json, logs etc but it kept repeating that there must have been a glitch; frustration built, I called it all kinds of things in one sentence and the next solution was the right one. This after 10 similar misguesses. It was one of those increasingly rare cases where I should have just done it myself, but I can never know going in how stubborn it will be in continue blaming the (obviously) wrong thing. The around 11 prompts to get to the answer were in a /clear opus 4.7 context (1m) on xhigh. | |||||||||||||||||||||||
| |||||||||||||||||||||||
| ▲ | morpheuskafka an hour ago | parent | prev | next [-] | ||||||||||||||||||||||
Wasn't it posted a few weeks ago that the frontend code for Claude or maybe Gemini or one of them had a swearing-at-model classifier that passed a flag to the backend? (Not sure why it was even done in frontend, but it was.) | |||||||||||||||||||||||
| ▲ | nathanmills 2 hours ago | parent | prev | next [-] | ||||||||||||||||||||||
Whenever I throw slurs at them they just refuse to respond | |||||||||||||||||||||||
| |||||||||||||||||||||||
| ▲ | arcanemachiner an hour ago | parent | prev | next [-] | ||||||||||||||||||||||
Personally, I have found that Claude absolutely shits the bed if I am rude to it like that. Qwen seems to handle it okay, though, and will course-correct when encouraged with excessive profanity. | |||||||||||||||||||||||
| ▲ | dugmartin 2 hours ago | parent | prev [-] | ||||||||||||||||||||||
I've found a mix of peppered in upper case words where you are effectively yelling at the LLM also gives it a strong signal. It is also a bit cathartic. | |||||||||||||||||||||||