GithubHelp home page GithubHelp logo

kkpan11 / tessdoc Goto Github PK

View Code? Open in Web Editor NEW

This project forked from tesseract-ocr/tessdoc

0.0 1.0 0.0 4.52 MB

Tesseract documentation

Home Page: https://tesseract-ocr.github.io/tessdoc/

HTML 100.00%

tessdoc's Introduction

Tesseract User Manual

This user manual is for Tesseract versions 5.x. For versions 4.x.x, 3.05.02 and older, see the documentation for old versions.

Introduction

Tesseract is an open source text recognition (OCR) Engine, available under the Apache 2.0 license.

Tesseract can be used directly via command line, or (for programmers) by using an API to extract printed text from images. It supports a wide variety of languages. Tesseract doesn't have a built-in GUI, but there are several available from the 3rdParty page. External tools, wrappers and training projects for Tesseract are listed under AddOns.

Tesseract can be used in your own project, under the terms of the Apache License 2.0. It has a fully featured API, and can be compiled for a variety of targets including Android and the iPhone. See the 3rdParty and AddOns pages for samples of what has been done with it.

If you have a question, first read the documentation, particularly the FAQ to see if your problem is addressed there. If not, search the Issues List, Tesseract user forum, and if you still can't find what you need, please ask your question in Tesseract user forum Google group.

Tesseract is free software, so if you want to pitch in and help, please do! If you find a bug and fix it yourself, the best thing to do is to attach the patch to your bug report in the Issues List.

Releases and Changelog

Tesseract with LSTM

Tesseract 4.0 added a new OCR engine based on LSTM neural networks. It works well on x86/Linux with official Language Model data available for 100+ languages and 35+ scripts. See 4.0x-Changelog for more details.

5.x.x

Source Code

Tesseract 5.x.x source code is available in the main branch of the repository. The main branch is using 5.0.0 semver versioning because C++ code modernization caused API incompatibility with 4.x release.

Binaries

Binaries are available from:

Traineddata Files

For detailed information about the different types of models, see Data Files.

Model files for version 4.00 are available from tessdata tagged 4.00. It has models from November 2016. The individual language file links are available from the following link.

Model files for version 4.0.0 and later are available from tessdata tagged 4.0.0. It has legacy models from September 2017 that have been updated with Integer versions of tessdata_best LSTM models. This set of traineddata files has support for both the legacy recognizer with --oem 0 and for LSTM models with --oem 1. These models are available from the following Github repo.

Two more sets of official traineddata, trained at Google, are made available in the following Github repos. These do not have the legacy models and only have LSTM models usable with --oem 1.

Language model traineddata files same as listed above for version 4.0.0 can be used with Tesseract 5.x.x. These are available from:

Compiling and Installation

Usage

API Examples

Technical Information

Training for Tesseract 5

Training with tesstrain.sh (a.k.a tesseract 4 training) in unsupported/abandoned. Please use scripts from tesseract-ocr/tesstrain for training.

Testing

External Projects

User Manual for Old Versions

tessdoc's People

Contributors

amitdo avatar shreeshrii avatar stweil avatar zdenop avatar egorpugin avatar gerhobbelt avatar tfmorris avatar jbreiden avatar theraysmith avatar jbarlow83 avatar electricrcaircraftguy avatar ic avatar alexcohn avatar cxcorp avatar thadguidry avatar scott0123 avatar floriankoerner avatar jimregan avatar solomennikm avatar ajinkya933 avatar mrocr avatar williamtambellini avatar stefre avatar bertsky avatar metaforte avatar dan9er avatar akulavinaykumar avatar nickjwhite avatar johnthagen avatar jeanmonet avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.