Remix clone Hacker News

new | show | ask | jobs Github

	▲	wittlesus 2 hours ago
		Genuine question for anyone running in-process vector search in production: when do you reach for something like this vs. an external service? The appeal of in-process is obvious — no network hop, simpler deployment, lower latency on small-to-medium datasets. But I'm curious about the operational story. How do you handle index updates while serving queries? Is there a write lock during re-indexing, or can you do hot swaps? The mceachen comment about sqlite-vec being brute-force is interesting too. For apps with under ~100K embeddings, does the algorithmic difference even matter practically, or is the simpler deployment story more important than raw QPS?
	▲	yawnxyz an hour ago \| parent [-]
		useful for adding semantic search to tiny bits of data, e.g. collections of research papers in a folder on my computer, etc. for web stuff, e.g. community/forums/docs/small sites which usually don't even have 1M rows of data, precomputing embeddings and storing them and running on a small vector search like this somewhere is much simpler/cheaper than running external services it's the operational hassle of not having to deal with a dozen+ external services, logins, apis, even if they're free (I do like mixed bread for that, but I'd prefer it to be on my own lightweight server or serverless deployment)