Remix.run Logo
Ask HN: How do you avoid / get out of LLMs local minima?
3 points by d--b 14 hours ago | 3 comments

When using LLMs, I have the impression that they often stick to an idea which they dont seem to be able to shake off, no matter how much arguing I can put in, and however strongly I feel about it.

What's your trick to get the LLM out of its own beliefs ?

I often find myself having to kill the entire context, which is obviously not the best...

skvark an hour ago | parent | next [-]

Having a separate planning / research phase helps with this. Make the LLM curate a plan by gathering internal and external context. Then execute the plan in another fresh session. Of course if the planning phase itself ends up in the local min then I would just start a new planning session with the learnings.

Jimmc414 7 hours ago | parent | prev | next [-]

This is conversational momentum due to the autoregressive nature of models. Each token is drawn from a probability distribution conditioned on the preceding tokens. The best method I know of working around this is to request and curate a markdown export package from a conversation that is used to prime a new conversation in a clean context.

uberman 14 hours ago | parent | prev [-]

I have found that in such situations all one needs to do is ask "are you sure x is y" atleast as far as most Claude models go. That usually results in and apology and and escape from your local min.