▲ | nostrebored 10 hours ago | |
That "much smaller number" is the tricky part. Most rerankers degrade substantially in quality over a few hundred candidates. No amount of powerful rerankers will make "high powered behavior based models" more effective. Those behavioral signals and intents have to be encoded in the query and the latent space. | ||
▲ | janalsncm 7 hours ago | parent [-] | |
> Most rerankers degrade substantially in quality over a few hundred candidates. The reason we don’t use the most powerful models on thousands/millions of candidates is because of latency, not quality. It’s the same reason we use ANN search rather than cosine sim for every doc in the index. |