I am using pytesseract to extract text from an invoice document. I am able extract the information. Do we have any function in pytesseract or any further approach to get thi