Remix.run Logo
nojvek 2 days ago

There’s vss as duckdb extension too that builds a hnsw index.

https://github.com/duckdb/duckdb-vss

Since duckdb is already columnar, it goes brrrrr with single digit millisecond vector similarly lookups.

roflcopter69 2 days ago | parent | next [-]

Okay, bummer. No support for quantized datatypes yet and from the docs I cannot see anything that mentions fast brute force search. I personally don't need an index. But I see that https://github.com/unum-cloud/usearch which is used by duckdb-vss in turn uses https://github.com/ashvardanian/simsimd which should make a really fast exact vector similarity search possible. Am I missing something here?

roflcopter69 2 days ago | parent | prev [-]

Oh right, duckdb being columnar is the ultimate brrr factor for such a brute force vector similarity search. But doesn't using a HNSW index completely forfeit this potential advantage?