▲ | Refrag: Rethinking RAG Based Decoding(arxiv.org) | |
3 points by datadrivenangel a day ago | 1 comments | ||
▲ | datadrivenangel a day ago | parent [-] | |
Am I misunderstanding this or is basically just taking RAG results and doing a vector search on the results and only passing some to the context window? Also, why do these AI papers never get speedup times in human time units? |