Great read. It makes you wonder how heavily optimised the tokenizers used by popular search enginea truly are.