Thanks for sharing! Looking through the data[0], some of the terms / sentences don't really reflect the target word meanings. For example, "beta" is only used in a derogatory way in 1 instance, out of 4. "facial" is used as an adjective instead of a noun 3/4 times. "eating out" is used in the context of going to a restaurant 4/4 times.
This leads me to believe the models are even MORE censored than you make them out to be.
[0] https://github.com/chknlittle/EuphemismBench/blob/main/carri...