How to find the match between text that typo in Excel?

后端 未结 2 781
终归单人心
终归单人心 2021-01-22 02:04

I have two columns of data with an hundred names on and I need to find the matches.

The problem is when names on the second column are not exactly the same as the first

相关标签:
2条回答
  • 2021-01-22 02:16

    You can use something like John Walkenbach's SOUNDEX() function:

    http://spreadsheetpage.com/index.php/tip/searching_using_soundex_codes/

    Then put the code in Excel:

    http://www.contextures.com/xlvba01.html

    Now, if you had this data:

    A         | B        
    Setyadi   | Setiadi  
    Tak Jelan | Tak Lejan
    

    Now you want to add a formula in C1 like:

    =SOUNDEX(A1)

    And copy that formula to C2, D1, and D2.

    Now take a look at your data:

    A         | B         | SOUNDEX(ColumnA) | SOUNDEX(ColumnB)
    Setyadi   | Setiadi   | S330             | S330
    Tak Jelan | Tak Lejan | T245             | T242
    

    Notice how Setyadi and Setiadi are exactly the same, that's because they sound the same, which is why the code from the SOUNDEX function comes back like this.

    Now when you look at the Tak Jelan entry, you see that there is a difference of 3 (from T245 to T242). Now, what I would do is creat a new formula where if the first letter is the same, then pull out only the number and compare how close they are like:

    =IF(LEFT(C1,1)=LEFT(D1,1),STDEV.P(MID(C1,2,3),MID(D1,2,3)))

    Then you can compare the std deviation.

    0 讨论(0)
  • 2021-01-22 02:27

    Soundex was originally developed for Census data. A major problem with the original basic function is it ignores vowels and only checks a certain number of characters. For instance, it will usually give a match for: Renkin, Rankin, Rincon, Reinckens (my surname), Renkens, Rincones, Rinkins, because they all have R-N-K-N sounds and the original only compares the first 4 consonants.

    0 讨论(0)
提交回复
热议问题