Comments (2)
Can you elaborate? When recognizing in plain text mode, the text from each recognized image is separated with a line break. If you recognize to hOCR, the output is split into separate pages. Not sure where you get one continuous text block with the text of all recognized images?
from gimagereader.
Hello manisandro !
Thanks for the message.
From practical experience it has been time to convert an image file with the text it has in text form (with OCR) the lines in order - it is fine. Only I mean when many files it is not in order. There is a missing line break between the files of the image file.
The following is a simplified explanation based on an example:
Image-File 1: The melting Arctic is a crime scene.
Image-File 2: J7 is the anonymous perpetrator leaving evidence and clues for me to discover,
Image-File 3: like breadcrumbs leading back to him. James, he had said,
Image File 4: the day we first met at the research institute,
Image File 5: "If you are going to make it up here, don’t lock your doors."
Image File 6: It seemed like a life philosophy, rather than a survival tip.
This was converted without a line break and looks like this:
The melting Arctic is a crime scene.
J7 is the anonymous perpetrator leaving evidence and clues for me to discover,
like breadcrumbs leading back to him. James, he had said,
the day we first met at the research institute,
“If you are going to make it up here, don’t lock your doors.”
It seemed like a life philosophy, rather than a survival tip.
Actually, it should look like this with a line break (this mean automatically make an enter paragraph):
The melting Arctic is a crime scene.
J7 is the anonymous perpetrator leaving evidence and clues for me to discover,
like breadcrumbs leading back to him. James, he had said,
the day we first met at the research institute,
“If you are going to make it up here, don’t lock your doors.”
It seemed like a life philosophy, rather than a survival tip.
from gimagereader.
Related Issues (20)
- No option to download openSUSE package in OBS GUI. HOT 2
- Incompatible with Podofo 0.10.1 ... HOT 1
- [Bug] The program fails to install certain dictionaries HOT 5
- [Bug] The program crashes when the OCR language is not english HOT 6
- No executables (released/compiled on Nov/7/2023) on Windows [10] are working. HOT 3
- There is a virus warning in the latest version HOT 1
- Feature request: localized inverted colors.
- Feature request: Sync Scroll or cursor sync on line HOT 1
- Latin script or Trained data makes it crash each time. HOT 2
- gimagereader-gtk: symbol lookup error HOT 2
- Not sure how to proceed with the workflow? HOT 4
- Remove linebreaks within paragraphs on export to ODT / Apply font size in HTML HOT 3
- Segfault on Alpine (OpenCL, Tesseract issue?) HOT 3
- Detection rate of 3.4.x inferior compared to previous version 3.3.1 HOT 1
- Compiling latest version in ubuntu HOT 7
- Qt version does not offer "JPG" for open, only "jpg" HOT 1
- Private API enchant-provider.h, no more available since enchant 2.7 HOT 4
- Cooperation with .uzn file
- poor performance compared to raw tesseract HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from gimagereader.