| ▲ | all2 2 days ago | ||||||||||||||||
If we treat LLM output as a manufacturing output if you have three 80% probabilities you actually have something like 0.80.80.8 -> 0.512 or 51%. | |||||||||||||||||
| ▲ | scottmu 2 days ago | parent [-] | ||||||||||||||||
Yes, there's a wide variety of use cases that require different ratios of accuracy/speed. If you require 3 responses to be accurate, you have to multiply all 3 response accuracy probabilities, and as you've shown, this can reduce overall accuracy quite a bit. Of course, this does make the assumption that those 3 responses are independent of one another. | |||||||||||||||||
| |||||||||||||||||