▲ | Der_Einzige 3 days ago | |
Funny, I use Mistral because it has 'more" of that same factor, even in the name! They're the only company who doesn't lobotomize/censor their model in the RLHF/DPO/related phase. It's telling that they, along with huggingface, are from le france - a place with a notably less puritanical culture. | ||
▲ | falseAss 2 days ago | parent [-] | |
do you feel the less censorship yourself from their instruction tuned model, or is there some public reference to showcase? (i haven't used mistral model before). It's interesting if a major llm player adopt a different safety / alignment goal. |