Fast Levenshtein distance in R?

前端 未结 4 1654
野趣味
野趣味 2020-11-30 06:07

Is there a package that contains Levenshtein distance counting function which is implemented as a C or Fortran code? I have many strings to compare and stringMatch

相关标签:
4条回答
  • 2020-11-30 06:21

    And stringdist in the stringdist package does it too, even faster than levenshteinDist under certain conditions (1)

    0 讨论(0)
  • 2020-11-30 06:23

    You could also use levenshtein_distance() from the textTinyR package. I got 'calloc' memory errors with all other packages when it came to larger character vectors of around 30k characters. Only textTinyR worked for me!

    0 讨论(0)
  • 2020-11-30 06:27

    You could try stringDist from Biostrings as well

    0 讨论(0)
  • 2020-11-30 06:31

    levenshteinDist (from the RecordLinkage package) calls compiled C code. Give it a try.

    0 讨论(0)
提交回复
热议问题