I want to perform preprocessing of image file that are forms and receipts. But how to identify the possible candidates(images) that require preprocessing and perform OCR directl