amit2014 / pdf-extractor Goto Github PK
View Code? Open in Web Editor NEWPDF Extractor, a powerful Python application that simplifies the extraction of highlighted text from PDF files.
License: MIT License
PDF Extractor, a powerful Python application that simplifies the extraction of highlighted text from PDF files.
License: MIT License
# PDF Extractor The PDF Extractor is a Python application that extracts highlighted text from PDF files using the PyMuPDF library. It provides a user-friendly graphical interface for selecting a PDF file and displaying the extracted information. ## Features - Extracts highlighted text from PDF files - Supports two different date formats for effective and expiry dates - Displays the extracted information in a formatted output - Provides a graphical interface for easy interaction ## Prerequisites - Python 3.x - PyMuPDF library (`pip install pymupdf`) - Tkinter library (included with most Python installations) ## Getting Started 1. Clone the repository or download the source code. 2. Install the required dependencies by running `pip install -r requirements.txt`. 3. Run the `pdf_extractor.py` file using Python: `python pdf_extractor.py`. 4. The application will launch, and a file dialog will prompt you to select a PDF file. 5. Select a PDF file that contains highlighted text. 6. The application will extract the highlighted text and display it in a graphical interface. 7. The extracted information will be shown in a formatted output, including the name of the insured, policy number, effective date, and expiry date. 8. Close the application when you're done. ![GUI Screenshot -1](https://raw.githubusercontent.com/amit2014/PDF-Extractor/master/example/1.png) ![GUI Screenshot -2](https://raw.githubusercontent.com/amit2014/PDF-Extractor/master/example/2.png) ![GUI Screenshot -3](https://raw.githubusercontent.com/amit2014/PDF-Extractor/master/example/3.png) ## License This project is licensed under the [MIT License](LICENSE). ## Acknowledgements - PyMuPDF: https://pymupdf.readthedocs.io/ - Tkinter: [Python Software Foundation](https://docs.python.org/3/library/tkinter.html) ## Author Amit Jadhav
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.