LLM backtracking is an active area of research, see e.g.

And I was wrong that nobody has implemented it, as these papers prove people have… it is just the results haven’t been sufficiently impressive to support the transition from the research lab to industrial use - or at least, not yet

▲

measurablefunc 6 days ago | parent | next [-]

> Empirical evaluations demonstrate that our proposal significantly enhances the reasoning capabilities of LLMs, achieving a performance gain of over 40% compared to the optimal-path supervised fine-tuning method.

▲

afiori 6 days ago | parent | prev [-]

I would expect to see something like this soonish as around now we are seeing the end of training scaling and the beginning of inference scaling

	▲	foota 6 days ago \| parent [-]
		This is a neat observation, training has been optimized to hell and inference is just beginning.