Remix.run Logo
tebeka 12 hours ago

https://duckdb.org/2024/05/03/vector-similarity-search-vss

jlarks32 6 hours ago | parent | next [-]

+1 on this one, I've been pleasantly surprised by this for a small (<3GB) local project

m00dy 10 hours ago | parent | prev [-]

does duckdb scale well over large datasets for vector search ?

lgrebe 10 hours ago | parent [-]

What order of magnitude would you define as „large“ in this case?

m00dy 9 hours ago | parent [-]

like over 1tb.

cess11 8 hours ago | parent [-]

Some people are using DuckDB for large datasets, https://duckdb.org/docs/stable/guides/performance/working_wi... , but you'd probably do some testing under the specific conditions of your rig to figure out if it is a good match or not.