Remix.run Logo
Aurornis 3 hours ago

Abliterarion is a brute force technique that removes or silences parts of the model. It reduces performance because the abliterated elements aren’t perfectly isolated to censorship so other aspects suffer.

Many of the “uncensored” model providers also do some fine tuning on the models. Some of them target better benchmarks or other measures, but outside of the benchmarks and metrics they’re fine tuned for they are generally noticeably worse than the original model.

yowlingcat 2 hours ago | parent [-]

The kind of abliteration you are mentioning is no longer state of the art or the most common form of removing the refusal layer in most models. Your your understanding was up to date about a year and a half ago, but has been out of date since after that.

ls612 an hour ago | parent [-]

Nowadays it is that Heretic tool is it not? I’ve seen Gemma models uncensored with it.