I have several hundred documents of multiple pages with text which I extract from the web. I am trying to extract all country names which appear in the text plus e.g. 10 words b