问题
Is there a package that contains Levenshtein distance counting function which is implemented as a C or Fortran code? I have many strings to compare and stringMatch
from MiscPsycho
is too slow for this.
回答1:
levenshteinDist (from the RecordLinkage
package) calls compiled C code. Give it a try.
回答2:
And stringdist
in the stringdist package does it too, even faster than levenshteinDist
under certain conditions (1)
回答3:
You could try stringDist
from Biostrings
as well
来源:https://stackoverflow.com/questions/3182091/fast-levenshtein-distance-in-r