Remix.run Logo
cpursley 6 hours ago

It's common to do a hybrid of BM25 with other fuzzy search or pgvector.

storus 5 hours ago | parent [-]

BM25 is quite bad and needs to be retrained for each corpus anew. SPLADEv2 is much better and there are even better sparse embeddings these days.