▲ | swores a day ago | |
Copied from the paper (halfway down page 6: https://arxiv.org/pdf/2508.15260 ) > "Specifically, DeepConf-low uses top η= 10% (corresponding to the 90th percentile) and DeepConf-high uses top η = 90% (corresponding to the 10th percentile) uniformly across all settings. This threshold ensures that during online generation, traces are terminated when their confidence falls below the level that retains the top η% highest-confidence traces from the warmup phase." I'm not sure if I'm parsing it right, but are they using "low" and "high" as descriptors of the number used as the %, meaning that the "low" 10 cuts anything outside the best 10%, while the "high" 90 leaves the best 90% ie high is less selective than low? | ||
▲ | carbocation a day ago | parent [-] | |
Thanks, this is a helpful breakdown. |