Can you suggest a good minhash implementation?

前端 未结 4 1924
太阳男子
太阳男子 2021-01-31 10:50

I am trying to look for a minhash open source implementation which I can leverage for my work.

The functionality I need is very simple, given a set as input, the impleme

4条回答
  •  无人及你
    2021-01-31 11:38

    You should have a look at the following open source libraries, in order. All of them are in Python, and show how you can calculate document similarity using LSH/MinHash:

    lsh
    LSHHDC : Locality-Sensitive Hashing based High Dimensional Clustering
    MinHash

提交回复
热议问题