What techniques/tools are there for discovering common phrases in chunks of text?

后端 未结 3 424
我寻月下人不归
我寻月下人不归 2021-01-02 17:28

Lets say I have 100000 email bodies and 2000 of them contains an abitrary common string like \"the quick brown fox jumps over the lazy dog\" or \"lorem ipsum dolor sit amet\

3条回答
  •  醉梦人生
    2021-01-02 18:12

    I'm not sure if this what you want but check out longest common substring problem and diff utility algorithms.

提交回复
热议问题