Remix.run Logo
Terr_ 6 days ago

A regular "chatting" LLM is a document generator incrementally extending a story about a conversation between a human and a robot... And through that lens, I've been thinking "chain of thought" seems like basically the same thing but with a film noir styling-twist.

The LLM is trained to include an additional layer of "unspoken" text in the document, a source of continuity which substitutes for how the LLM has no other memories or goals to draw from.

"The capital of Assyria? Those were dangerous questions, especially in this kind of town. But rent was due, and the bottle in my drawer was empty. I took the case."

areeh 6 days ago | parent | next [-]

Oh wow, now I want a chain of thought rewriter that makes the combination of chat and CoT put together follow this style

Terretta 6 days ago | parent [-]

Click show details in chatgpt deep research for such a tale except instead of noir it's "Office Space".

Terr_ 5 days ago | parent | prev [-]

[P.S.] I realize I left off an important part: If the regular version did not already "reason"... Why would we ever expect this kind of tweak to change that, and bright forth real reasoning?

The core algorithm hasn't really changed, we're just changing the (hidden) document so that it's a different style with a greater density of clues, so that it can more-effectively bullshit [0] output humans won't notice and dislike.

[0] Creating something that "sounds good" without any particular awareness or care about truth or falsehood.

RainyDayTmrw 5 days ago | parent [-]

The steelman argument - not to say that I agree with it - is that this capability was present all along, but due to variability in the behavior of the model, adding certain context, such as chain-of-thought, exposes it.