I am working on using spacy for some NLP tasks, such as calculating entity frequency and PMI scores (relationship ranking between organization entities and lemmas). My corpus of