Can you suggest a good minhash implementation?

前端 未结 4 1928
太阳男子
太阳男子 2021-01-31 10:50

I am trying to look for a minhash open source implementation which I can leverage for my work.

The functionality I need is very simple, given a set as input, the impleme

4条回答
  •  滥情空心
    2021-01-31 11:24

    I would suggest you this library, especially if you need persistence. Here, you can use redis to store/retrieve all your data.

    You have the option to select a redis database, or to simply use built-in in-memory python dictionaries.

    Performances using redis, at least if redis server is running on your local machine, are almost identical to those achieved via standard python dictionaries.

    You only need to specify a config dictionary such as

    config = {"redis": {"host": 'localhost', "port": '6739', "db": 0}}
    

    and pass it as an argument to the LSHash class constructor.

提交回复
热议问题