I have a problem with running the Entity Ruler on a retokenized doc. What I would like to achieve is to: 1)take a text 2)Merge some tokens based on regex patterns e.g.: Web-