Hamming Distance vs. Levenshtein Distance

后端 未结 2 1495
失恋的感觉
失恋的感觉 2021-02-01 12:33

For the problem I\'m working on, finding distances between two sequences to determine their similarity, sequence order is very important. However, the sequences that I have are

相关标签:
2条回答
  • 2021-02-01 13:12

    That question really depends on the types of sequences you are matching, and what result you want.

    If it's not a problem that "1234567890" and "0123456789" are considered totally different, indeed Hamming distance is fine.

    0 讨论(0)
  • 2021-02-01 13:30

    In addition to the right Johan answer, the padding can be problematic.

    For example, when you compare 123 to 123456 it's different if you pad either at the end of the string or at the start of the string. The similarity of ___123 with 123456 is 0, but The similarity of 123___ with 123456 is 3.

    0 讨论(0)
提交回复
热议问题