Say I have a thousand letters and I want to extract only the "body" text for Natural language processing, how do I programmatically extract only the body text from