There is already ablated Deepseek models out there that will do just that.
https://huggingface.co/NaniDAO/deepseek-r1-qwen-2.5-32B-abla...
https://www.lesswrong.com/posts/jGuXSZgv6qfdhMCuJ/refusal-in...