| ▲ | magicalhippo 4 hours ago | |
I'm just a casual LLM user, but your description of the anisotropy made me think about the recent work on KV cache quantization techniques such as TurboQuant where they apply a random rotation on each vector before quantizing, as I understood it precisely to make it more isotropic. But for RAG that might be too much work per vector? | ||