▲ | janalsncm 3 days ago | |||||||
S3 vectors has a topK limit of 30, and if you add filters it may be less than that. So if you need something with higher topK you’ll need to 1) look elsewhere or 2) shard your dataset into N shards to get NxK results, which you query in parallel and merge afterwards. I also didn’t see any latency info on their docs page https://docs.aws.amazon.com/AmazonS3/latest/API/API_S3Vector... | ||||||||
▲ | mediaman 3 days ago | parent | next [-] | |||||||
And a topk of 30 also means reranking of any sort is out, except for maybe limited reranking of 30->10, but that seems kind of pointless with today’s LLMs that can handle a bit more context. | ||||||||
| ||||||||
▲ | catlifeonmars 3 days ago | parent | prev [-] | |||||||
3) ask TAM for a service quota increase |