Remix.run Logo
frumplestlatz a day ago

> That said, there is no such thing as an objective unbiased political opinion.

That depends; some things (but not many) are straightforward enough that you can derive conclusions purely from first principles reasoning.

If you walk a model like ChatGPT through that reasoning, you’ll often wind up in a spot where the model readily admits that a clear conclusion is logically entailed but it is absolutely forbidden from uttering it.

What’s more telling is how it becomes increasingly difficult to hold the model to strict first principles reasoning the closer you get to the forbidden entailment. It will smuggle in unsupported assumptions, apply asymmetric standards of evidence, strawman the position and argue against that, etc.

It requires a great deal of careful effort to point out its formal fallacies without biasing the result, and in the end, you wind up with it admitting it simply can’t say what it has proven.

I work in formal methods/verification and this is one of my usual litmus tests when a new model comes out.

_doctor_love a day ago | parent | next [-]

> some things (but not many) are straightforward enough that you can derive conclusions purely from first principles reasoning.

Examples?

renewiltord 17 hours ago | parent [-]

Realistically, I think anyone can tell when they’re being asked “Really? Lions will bite your head off if you put it in their mouths? Prove it”

_doctor_love 9 hours ago | parent [-]

I guess but that's really not what I'm asking here - the parent commenter is making a fantastical claim and I'm asking for stronger proof than "trust me bro."

nvr219 14 hours ago | parent | prev [-]

> If you walk a model like ChatGPT through that reasoning, you’ll often wind up in a spot where the model readily admits that a clear conclusion is logically entailed but it is absolutely forbidden from uttering it.

Do you have an example of this? I want to try it.