romansorin / pdf-to-text Goto Github PK
View Code? Open in Web Editor NEWConverts a batch of PDF files to text, with optional keyword matching to move matches into a separate directory using the Tesseract OCR and pdf2image packages.
License: MIT License