| ▲ | ozgrakkurt 3 hours ago | |
You can actually make those two bits more independent afaik. https://github.com/apache/parquet-format/blob/master/BloomFi... https://github.com/facebook/rocksdb/blob/main/util/bloom_imp... First one is useful for grasping the idea second one is more comprehensive. Both try to make multiple bit loads but try to make them as independent as possible as far a I can understand. Also hash function has huge effect on bloom filter performance. I was getting 2x perf when using xxhash3 instead of wyhash even though wyhash is a faster hash function afaik. | ||