Comments (1)
Sorry. After I have a look at hOCR.html using tesseract 3.02.02 command, I
understand why.
With spaces between two characters, hOCR shows that sometimes it is regarded as
separator, sometimes as spaces, sometimes as an empty word. So it is very hard
to know which word corresponds to which line and which boundingbox.
It seems it is better for the tesseract-android-tool to use an api for output,
so that we could know each line contains what words, and each word corresponds
to each confidence values and boundingbox.
ps. I apologize I made a mistake by claiming it should have no "-" outputs. I
also trained "-", and forgot to exclude it.
Thanks.
Original comment by [email protected]
on 16 Nov 2012 at 8:08
from tesseract-android-tools.
Related Issues (20)
- PSM constants are incorrect HOT 1
- Add tests for Bitmap <=> Pix conversion
- Error in loading eng.traineddata (3.01) using com.android.content.res.AssetManager() HOT 6
- Getting Fatal signal 11 error while capture line of numbers HOT 1
- Update to Tesseract 3.02 HOT 1
- Missing files for native build HOT 1
- Binarize Image in tesseract
- How do I use tesseract-andriod-tools?
- WriteFile don't work as expected
- ndk-build fail allheaders.h missing HOT 4
- ndk-build failure due to unsafe fprintf usage in tesseract source. HOT 1
- Assert failed when using tesseract with PSM_OSD_ONLY flag
- nativeSetImage not found
- ChoiceIterator
- Getting result words array
- javax.imageio.IIOException: ICC APP2 encountered without prior JFIF in Tesseract OCR
- root galaxy centra sch-7
- how to Image binarization
- error on compiling tessaract-android-tools ndk-build fails
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from tesseract-android-tools.