I started with a few dozen pdf files and I\'ve extracted the text in each one by looping through the following:
import pdfplumber def get_text(file): with