Comments (4)
Yes this would definitely be a nice feature. Need to think about how to best do the GUI integration though. I suppose something like
- add a combobox next to the OCR button allowing the user to choose between text output and overlay output
- if text output is chosen, behavior is as it is currently
- if overlay output is chosen, instead of a text field in the output pane, display a pdf-viewer style widget, displaying, for each recognized source image, the unmodified image overlayed with text fields containing the recognized text. Though should it still be allowed to recognize only portions of an image, and if yes, how to detect whether the same portion has already been recognized? An approach would be, every time a certain page is recognized, to add it as a separate page to the output pane, regardless of whether such a page already exists.
Thoughts welcome.
from gimagereader.
I think this is linked to the ideas in issue #30. The hOCR output could be used to create a search-able pdf. There is already a program available (hocr2pdf) that does the job.
from gimagereader.
As noted in issue #30, an initial implementation is now committed, but some work remains to be done.
from gimagereader.
A first implementation should be pretty usable now.
from gimagereader.
Related Issues (20)
- Please merge complete Hungarian (magyar) translation from Weblate HOT 3
- Cannot build against pododo 0.10.0 HOT 4
- Crash (without warning) when reading certain sequences of characters HOT 7
- Allow deletion of multi-selection
- Inconsistent results HOT 9
- gImageReader (3.4.1) crashes when Santali tessdata is used HOT 1
- Feature Request: ctrl-v to quickly recognize image from clipboard
- Complete Arabic Translation for gImageReader UI HOT 2
- Entrypoint Problem with DockerFile
- Arch Linux link is dead HOT 1
- Setting to ignore pictures HOT 2
- problem with export to ods HOT 3
- Unclear instructions on how to adjust Brightness / Contrast of multiple sources HOT 1
- install instructions for rocky linux 9? HOT 1
- feature request: table of contents HOT 1
- Failure to download and install dictionary HOT 6
- Failure with the dictionary of the Polish language HOT 3
- Does not detect white text on a blue/purple background HOT 1
- The application unnecessarily automatically rotates the image HOT 3
- No option to download openSUSE package in OBS GUI. HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from gimagereader.