fuzzy matching two strings uring r
问题 I have two vectors, each of which includes a series of strings. For example, V1=c("pen", "document folder", "warn") V2=c("pens", "copy folder", "warning") I need to find which two are matched the best. I directly use levenshtein distance. But it is not good enough. In my case, pen and pens should mean the same. document folder and copy folder are probably the same thing. warn and warning are actually the same. I am trying to use the packages like tm. But I am not very sure which functions are