| ▲ | NekkoDroid 2 hours ago | |||||||
> Try Mistral-Nemo-2407-12B-Thinking-Claude-Gemini-GPT5.2-Uncensored-HERETIC_Q4_k_m.gguf. Thiss sounds like such a shitpost I initially thought you were joking... but this seems to be a real model??? | ||||||||
| ▲ | cpburns2009 19 minutes ago | parent | next [-] | |||||||
There's a method to the madness: - Mistral-Nemo: the actual model developed by Mistral and Nvidia. - 2407: likely the release date of the base model, July of 2024. - 12B: the model has 12 billion parameters. - Thinking: the model operates in thinking mode (generates output plan and injests it before producing actual output). - Claude-Gemini-GPT5.2: I think this means the model was finetuned with session data from Claude, Gemini, and GTP5.2 to replicate their behavior. - Uncensored-HERITIC: the model was uncensored using the automated Heretic method. - Q4_k_m: the model is quantized (lossy compression) to ~5 bpw from orignal 16 bpw. | ||||||||
| ||||||||
| ▲ | mring33621 2 hours ago | parent | prev [-] | |||||||
It is! I like to try the variations from possibly 'interesting' people. Some of them are good. Others randomly break into gibberish and Chinese poetry(?). | ||||||||