GithubHelp home page GithubHelp logo

harshablaze / printed-text-to-speech-using-ocr-and-spell-correction Goto Github PK

View Code? Open in Web Editor NEW
3.0 1.0 2.0 327.13 MB

printed text to speech conversion by improving OCR accuracy using spell correction

Python 100.00%
opencv2 tesseract-ocr spelling-correction image-to-speech roi-segmentation image-processing detect-names-in-spell-correction

printed-text-to-speech-using-ocr-and-spell-correction's Introduction

NOTE: I developed this project on Windows 10 X64bit OS and Python 3.9.2 X64bit version python Environment
if you want to run this project on linux change the paths and tesseract path in ocr.py


Setting up the project

1. After Downloading the project

  • src>cd 4-4-project

2. activate the env

  • src\4-4-project>.\env\Scripts\activate

NOTE: if your env is activated you get (env) before your src path if env failed to activate then follow step 3 else go to step 4

3. install requirements.txt

  • src\4-4-project>pip install -r requirements.txt

Run the project

4. to start the project

  • src\4-4-project>python start.py


after start is running acquireimage will run automatically
press spacebar to capture image else Esc to enter image number manually
for manual input first save image in images folder with sample45.jpg name (any number instead of 45 is ok)
now if you pressed Escape button while acquiring image it asks for image number so enter 45 or the number you saved
you can modify code to take manual input images from your custom folder

while execution some images may popup to show internal process press ctrl+W or Esc to proceed


Troubleshooting errors

1. if you encountered errors in installing requirements.txt

  • check python architecture
src\4-4-project>python
>>import platform
>>platform.architecture()

output: ('64bit', 'WindowsPE')

2. if errors encountered after executing start.py

  • check if it is package error or input files error or path errors and rectify manually

any issues connect with me through linkedin

printed-text-to-speech-using-ocr-and-spell-correction's People

Contributors

harshablaze avatar trellixvulnteam avatar

Stargazers

 avatar  avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.