At the moment I\'m looking into doing some PDF merging with pyPdf, but sometimes the inputs are not in the right order, so I\'m looking into scraping each page for its page
Another Option is pymupdf: https://pymupdf.readthedocs.io/en/latest/tutorial.html
import fitz doc = fitz.open('Path To File') doc.pageCount
pip install pymupdf
For large documents I was getting a recursion error when using pypdf2 so this was another quick and simple way.