so how does llm moderation work now on all the major chatbots? they refuse prompts that are against their guidelines right?
Sometimes. That's the whole problem, in short.