mongodb approximate string matching

前端 未结 2 1230
谎友^
谎友^ 2021-01-31 10:06

I am trying to implement a search engine for my recipes-website using mongo db. I am trying to display the search suggestions in type-ahead widget box to the users.

I a

相关标签:
2条回答
  • 2021-01-31 10:24

    The best case for it would be using elasticsearch fuzzy query: https://www.elastic.co/guide/en/elasticsearch/reference/current/query-dsl-fuzzy-query.html

    It supports levenshtein distance algorithm out of the box and has additional features which can be useful for your requirements i.e.: - more like this - powerful facets / aggregations - autocomplete

    0 讨论(0)
  • 2021-01-31 10:43

    The text search feature in MongoDB (as at 2.6) does not have any built-in features for fuzzy/partial string matching. As you've noted, the use case currently focuses on language & stemming support with basic boolean operators and word/phrase matching.

    There are several possible approaches to consider for fuzzy matching depending on your requirements and how you want to qualify "efficient" (speed, storage, developer time, infrastructure required, etc):

    • Implement support for fuzzy/partial matching in your application logic using some of the readily available soundalike and similarity algorithms. Benefits of this approach include not having to add any extra infrastructure and being able to closely tune matching to your requirements.

      For some more detailed examples, see: Efficient Techniques for Fuzzy and Partial matching in MongoDB.

    • Integrate with an external search tool that provides more advanced search features. This adds some complexity to your deployment and is likely overkill just for typeahead, but you may find other search features you would like to incorporate elsewhere in your application (e.g. "like this", word proximity, faceted search, ..).

      For example see: How to Perform Fuzzy-Matching with Mongo Connector and Elastic Search. Note: ElasticSearch's fuzzy query is based on Levenshtein distance.

    • Use an autocomplete library like Twitter's open source typeahead.js, which includes a suggestion engine and query/caching API. Typeahead is actually complementary to any of the other backend approaches, and its (optional) suggestion engine Bloodhound supports prefetching as well as caching data in local storage.

    0 讨论(0)
提交回复
热议问题