Remix.run Logo
arnorhs 4 hours ago

That's an interesting story, but I'm really at a loss for how this relates to the post you are commenting on.

sminchev an hour ago | parent | next [-]

Thank you for your question. At least there is one person who shares opinion, when down-voting. This is good, because I know what I did wrong, and I highly respect any respectable feedback ;)

I really hate, when people down-vote, without giving any feedback what they don't like.

Levenshtein, in combination with Machine Learning and big data engines, like Apache Sparks, can do a good job comparing content as well ;)

Wanted to share another approach, and ideas to people who are interested in comparing strings, doing fuzzy searches, and searching for duplicated content.

devmor an hour ago | parent | prev [-]

I think they just forgot to link their train of thought. I have also used Levenshtein distance for deduplication comparisons so I can guess where the story came from.