GithubHelp home page GithubHelp logo

ocrstyle's Introduction

pageresOCR Style

Optical Character Recognition

背景

本程序最初是想用于发票识别的,目前可以通过ocrengine和imageprocess动态库进行二次开发。

功能

  • 图片识别文字的软件。
  • 支持截图。
  • 支持选择识别的区域。
  • 支持选择识别的语言。
  • 支持倾斜纠正。
  • 支持保存识别的文字。
  • 支持多种语言的识别,内置了中文和英文,可以根据需要下载其它语言。

编译环境

  • ocrstyle 使用的Qt4.8开发的。
  • ocrengine 使用Visual C++ Express 2008开发的。这个工程依赖third-party目录下的Tesseract OCR开发库,目前是google维护的,当时开发的时候官方只有VC2008的开发库,于是就用的VC2008开发了。项目地址:http://code.google.com/p/tesseract-ocr/。
  • imageprocess 使用的Visual C++ Express 2010开发的。这个工程依赖于OpenCV。项目地址:http://opencv.org/
  • buildscripts/ocrstyle.nsi 打包安装包使用的NSIS。

动态库的接口

  • ImageProcessAPI.h 主要功能:图像的加载、灰度处理、倾斜校正、保存等。
  • OCREngine.h 主要功能:配置识别的语言、传入要识别的图像数据、获得文本。

思路

  • 扫描的电子发票一般都有点倾斜,所以要对图片进行倾斜校正。为了方便识别需要进行灰度处理。
  • 一般是要识别发票上的部分内容,所以要想办法把要识别的文本区域的图像数据传给Tesseract OCR,然后获得文本。

截图

主界面

英文图片倾斜纠正并识别

中文发票进行区域选择并识别

ocrstyle's People

Contributors

vwarship avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

ocrstyle's Issues

我想识别 银行卡不知道能实现不?

楼主,我是初学者,我想识别 银行卡文字,
请教其中3个工程 ocrstyle、ocrengine、imageprocess如何合并才能生成".exe"文件。还有乍样训练银行卡文字样。
请教楼主说具体一点好吗?

No unzip.h file

ZlibWrapper.h:37: error: unzip.h: No such file or directory

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.