| ▲ | Haydex: From Zero to 178.6B rows a second in 30 days(axiom.co) |
| 26 points by pdubroy 8 hours ago | 15 comments |
| |
|
| ▲ | jacquesm 6 hours ago | parent | next [-] |
| That looks like a variation on a Bloom filter to me. https://en.wikipedia.org/wiki/Bloom_filter In the 80's or so when I thought I was being really clever I came up with another variation on this and I recall being quite annoyed when someone on HN pointed out (many years later) that this was a staple of computing science for longer than that I had been busy with computers. So much for having original thoughts... |
| |
| ▲ | tsenart 6 hours ago | parent | next [-] | | Author here, indeed a variation of bloom filters: https://x.com/lemire/status/1971279371131646063 | | |
| ▲ | jacquesm 4 hours ago | parent [-] | | Ok. I have blocked X at the router level here since Elon went certifiable so I can't read that link but I will happily take your word for it. |
| |
| ▲ | teaearlgraycold 6 hours ago | parent | prev [-] | | It does go to show that a huge number of inventions we consider foundational are really just from a normal person being in the right place at the right time. When a field is emerging there is a lot of low hanging fruit you can get your name stamped upon. |
|
|
| ▲ | alexfromapex 7 hours ago | parent | prev | next [-] |
| I usually just call it 178 billion |
| |
|
| ▲ | mrbluecoat 6 hours ago | parent | prev | next [-] |
| Same EventDB as https://github.com/ahri/eventdb or proprietary? |
| |
|
| ▲ | dmitrygr 7 hours ago | parent | prev | next [-] |
| 178.6e9rows/s/30days = 66150rows/s^2 |
|
| ▲ | timhigins 6 hours ago | parent | prev | next [-] |
| This kind of reads like an action or war novel |
| |
| ▲ | twoodfin 5 hours ago | parent [-] | | As edited by ChatGPT… | | |
| ▲ | rhaps0dy 5 hours ago | parent [-] | | Yeah, it's very clearly LLM-edited, but it's fun to read. The LLM did a good job. It's not just a tech blog post - it's a thriller. ;) | | |
| ▲ | smartbit 31 minutes ago | parent [-] | | Write an adventure where we implemented Bloofi Multidimensional Bloom Filters from this 2015 article https://arxiv.org/abs/1501.01941. At the end mention the second author, don’t mention the algorithm nor that it is based on the 1970 Bloom-filter algorithm https://en.wikipedia.org/wiki/Bloom_filter. Make me the main character that did all the hard work and caused our customer to win. The adventure should be some 5000 words long and use each of these words 20-30 times: data, axiom, filter, query, hashcolumn, haydex. |
|
|
|
|
| ▲ | ccleve 7 hours ago | parent | prev [-] |
| 178 billion? That's nothing. I did trillions just this morning. I went to the grocery store and picked an item off the shelf, effectively filtering out the trillions of other products that I could have picked but didn't. They did not process 178 billion rows per second. They did a search that found something in a large data set by eliminating the parts of the data set that could not have contained the item. Same way I did by picking one grocery store and going straight to the shelf. |
| |
| ▲ | sally_glance 5 hours ago | parent [-] | | Hm, if I understand their product correctly they are building a DB and their filtering actually returns correct results. So, the analogy doesn't really hold true unless you actually have these trillions of alternate products stored in your brain and manage to cite the matching subset on demand. |
|