Comments (4)
I think building a font-specific OCR system would be really useful, but
it's a bit outside of what Ocrad.js is meant to do. While working on
Naptha, I've played with building my own text recognition system which was
dependent on knowledge of the font a priori (another interesting problem to
solve is to figure out what font some text is written in before recognizing
its letters, which serves as the basis of an interesting chicken-and-egg
problem).
I'd really love to see something like this exist, and I'm not sure I'll
have the time to make something like this (and I'm not convinced I know
enough about OCR to do it well). But I too would love to see something like
this exist.
On Sat, May 3, 2014 at 5:33 AM, adam80 [email protected] wrote:
THis is a great tool and I have been playing around with it for the last
couple of days. Is there anyway to use an specific font for the basis of
OCR, say Calibri? This means that if the user know what the base font of
the text they are scanning is there would be a higher chance of conversion?
That is what I am hoping for.Please let me know of there is a way to do this if you can.
Thanks!
—
Reply to this email directly or view it on GitHubhttps://github.com//issues/8
.
from ocrad.js.
Is there any library you would suggest that can be configured for a particular font or be trained in some way? I'm looking for a way to recognize scoreboard data, ideally in client-side JS, and am curious where to begin if not this library. Thank you!
from ocrad.js.
from ocrad.js.
@antimatter15 Thanks, yes I've tried it but it's mis-identifying the numbers since they do look quite a bit like other characters. 8
, for example, appears often as a B
. If I could re-train the analysis, I think there'd be a lot more accurate detection.
I'm just getting started with this, so I'm not sure how to make my own dataset for Tesseract, but I'll continue looking into it. Thank you for your advice!
from ocrad.js.
Related Issues (20)
- How to change character set to iso-8859-9 is not documented HOT 1
- wrong reconize
- Uncaught DOMException: Failed to execute 'getImageData' on 'CanvasRenderingContext2D': The source width is 0.
- ability to 'cut' ie apply target rectangle to input
- Zonal OCR HOT 1
- Recognizing 8s
- don't work with ssl url
- dident find worker file
- Won't work when uploading image with bigger file size HOT 1
- Ocrad Node not recognize .jpg
- Can not get the nodejs example to work... HOT 1
- How to recognize Arabic font
- Invalid Number HOT 1
- Examples broken with last nodeJS canvas versions
- How do i run ocrad.js for development?
- When I build by build.sh got these errors, could you help me ? thank you ! HOT 7
- can not support Chinese? HOT 1
- much faster than tesseract.js HOT 1
- Chinese support in Ocrad instead of Tesseract
- browser example does not work HOT 5
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from ocrad.js.