how to recognize similar words with difference in spelling

前端 未结 8 1855
逝去的感伤
逝去的感伤 2020-12-02 02:05

I want to filter out duplicate customer names from a database. A single customer may have more than one entry to the system with the same name but with little difference in

相关标签:
8条回答
  • 2020-12-02 02:43

    The obvious, established (and well documented) algorithms for finding string similarity are:

    • Levenstein distance
    • Soundex
    0 讨论(0)
  • 2020-12-02 02:46

    Have a look at Soundex

    There is a Soundex function in Transact-SQL (see http://msdn.microsoft.com/en-us/library/ms187384.aspx):

    SELECT 
    SOUNDEX('brook berta'),
    SOUNDEX('Bruck Berta'),
    SOUNDEX('Biruk Berta')
    

    returns the same value B620 for each of the example values

    0 讨论(0)
提交回复
热议问题