GithubHelp home page GithubHelp logo

shafeeqahameds / automated-students-details-extraction-from-documents-using-yolov8-and-tesseract-ocr Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 1.23 MB

This project focuses on automating the extraction of details such as Name, Board of Education, and more from scanned documents. By leveraging Tesseract OCR, an Optical Character Recognition (OCR) tool, and implementing various computer vision techniques, we have developed an effective solution for this task.

License: MIT License

Python 6.35% Jupyter Notebook 89.55% HTML 4.11%

automated-students-details-extraction-from-documents-using-yolov8-and-tesseract-ocr's Introduction

AUTOMATED STUDENTS DETAILS EXTRACTION FROM DOCUMENTS USING YOLOv8 AND TESSERACT OCR

This project focuses on automating the extraction of details such as Name, Board of Education, and more from scanned marksheet. By leveraging Tesseract OCR, an Optical Character Recognition (OCR) tool, and implementing various computer vision techniques, we have developed an effective solution for this task.

About

In the evolving digital landscape, this project pioneers a transformative leap in text recognition. By integrating YOLOv8 and Tesseract, it efficiently bridges the divide between printed content and digital data utilization. With a focus on streamlining academic record extraction, the project's core objective is to cultivate a robust system, seamlessly identifying and processing crucial information. By automating document verification, it minimizes human error, elevating operational efficiency and marking a significant advancement in the domain of document validation.

Features

  1. Cutting-edge Integration: Harnesses the power of YOLOv8 and Tesseract for state-of-the-art object detection and optical character recognition.
  2. Seamless Automation: Effortlessly identifies and processes crucial information from documents, ensuring accuracy and operational efficiency.
  3. Mitigation of Human Error: Automates document verification to reduce the potential for human error in data extraction.
  4. Transparency and Efficiency: Represents a forward-thinking approach to document validation, enhancing operational efficiency and instilling transparency in administrative processes.

Project Flow

image

User Flow Diagram

image

Training Model

YOLOv8 Object Detection Model Training

Welcome to the training model page of our project. In this document, we'll provide you with an overview of the YOLOv8 object detection model, explain the dataset, and guide you through the training process.

YOLOv8 Overview

YOLOv8 (You Only Look Once version 8) is a state-of-the-art object detection algorithm. It provides real-time and accurate object detection by dividing an image into a grid and making predictions for objects within each grid cell. YOLOv8 is a versatile framework, and in this project, we've trained a custom YOLOv8 model for the specific task of detecting "Name", "Board of Education" and "STANDARD" within marksheets.

Dataset

Our dataset consists of nearly 500+ images, with "Name", "Board of Education" and "STANDARD" annotated in YOLOv8 format. The dataset has been pre-processed and augmented to enhance the diversity and quality of the training data. Augmentation techniques include horizontal and vertical flips, rotations, random cropping, brightness adjustments, exposure adjustments, Gaussian blur, and noise.

Training Process

To train our custom YOLOv8 model, we used the processed dataset and a specific model configuration. Training involved optimizing the model's parameters to accurately detect "Name", "Board of Education" and "STANDARD" within marksheets. We performed training for a specified number of epochs and evaluated the model's performance on validation data.

Model Configuration

Our YOLOv8 model is configured with specific hyperparameters, architecture details, and object detection settings. The model configuration file and weights are available in this repository.

Results and Evaluation

After training the model, we conducted an evaluation to assess its performance. We used various metrics to measure the accuracy, precision, recall, and F1-score of our object detection model.

val_batch0_pred results

License

This project is licensed under the MIT License.

automated-students-details-extraction-from-documents-using-yolov8-and-tesseract-ocr's People

Contributors

shafeeqahameds avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.