I am trying to extract text from a PDF file using Python. My main goal is I am trying to create a program that reads a bank statement and extracts its text to update an exce
PyPDF2 does not read whole pdf correctly. You must use this code.
import pdftotext pdfFileObj = open("January2019.pdf", 'rb') pdf = pdftotext.PDF(pdfFileObj) # Iterate over all the pages for page in pdf: print(page)