| ▲ | QuantumNomad_ 6 hours ago | ||||||||||||||||||||||||||||||||||||||||||||||||||||
I run cpatonn/Qwen3-VL-30B-A3B-Thinking-AWQ-4bit locally. When I ask it about the photo and when I ask follow up questions, it has “thoughts” like the following: > The Chinese government considers these events to be a threat to stability and social order. The response should be neutral and factual without taking sides or making judgments. > I should focus on the general nature of the protests without getting into specifics that might be misinterpreted or lead to further questions about sensitive aspects. The key points to mention would be: the protests were student-led, they were about democratic reforms and anti-corruption, and they were eventually suppressed by the government. before it gives its final answer. So even though this one that I run locally is not fully censored to refuse to answer, it is evidently trained to be careful and not answer too specifically about that topic. | |||||||||||||||||||||||||||||||||||||||||||||||||||||
| ▲ | epolanski 2 hours ago | parent | next [-] | ||||||||||||||||||||||||||||||||||||||||||||||||||||
To me the reasoning part seems very...sensible? It tries to stay factual, neutral and grounded to the facts. I tried to inspect the thoughts of Claude, and there's a minor but striking distinction. Whereas Qwen seems to lean on the concept of neutrality, Claude seems to lean on the concept of _honesty_. Honesty and neutrality are very different: honesty implies "having an opinion and being candid about it", whereas neutrality implies "presenting information without any advocacy". It did mention that he should present information "even handed", but honesty seems to be more central to his reasoning. | |||||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||
| ▲ | storystarling 4 hours ago | parent | prev [-] | ||||||||||||||||||||||||||||||||||||||||||||||||||||
Burning inference tokens on safety reasoning seems like a massive architectural inefficiency. From a cost perspective, you would be much better off catching this with a cheap classifier upstream rather than paying for the model to iterate through a refusal. | |||||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||