| ▲ | zardo 4 hours ago |
| I'm wondering how much the output quality of a small model could be boosted by taking multiple goes at it. Generate 20 answers and feed them back through with a "rank these responses" prompt. Or doing something like MCTS. |
|
| ▲ | freakynit 3 hours ago | parent [-] |
| Isn't this what thinking models do internally? Chain of thoughts? |
| |
| ▲ | andy12_ 3 hours ago | parent [-] | | No. Chain of thought it just the model generating a single answer for longer inside <think></think> tags which are not shown in the final response. The strategy of generating different answers in parallel is something different (which can be used in conjunction with chain of thought) and is the thing used by models like Gemini 3 Deep Think and GPT-5.2 Pro. | | |
|