GithubHelp home page GithubHelp logo

tesseract4java / tesseract4java Goto Github PK

View Code? Open in Web Editor NEW
310.0 20.0 91.0 1.88 MB

Java GUI and Tools for Tesseract OCR

License: GNU General Public License v3.0

Java 99.95% Shell 0.04% HTML 0.01%
tesseract-ocr java swing ocr gui

tesseract4java's Introduction

tesseract4java: Tesseract GUI

A graphical user interface for the Tesseract OCR engine. The program has been introduced in the Master’s thesis “Analyses and Heuristics for the Improvement of Optical Character Recognition Results for Fraktur Texts” by Paul Vorbach (German).

Donate with PayPal

Usage

Basic usage is documented on our wiki page

Download

Binary distributions and release notes are available in the releases section.

Screenshots

Preprocessing

Preprocessing view

Box Editor

Box editor for training

Glyph Overview

Glyph overview for easier detection of errors

Comparison View

Comparison view to compare the original document with the perceived result

Transcription View

Evaluation view with a transcription field

ocrevalUAtion

ocrevalUAtion

Batch Export

Batch export functionality to handle large projects

Building and running the software

This software is written in Java and can be built using Apache Maven. In order to build the software you have to follow these steps:

  1. git clone https://github.com/tesseract4java/tesseract4java.git
  2. cd tesseract4java
  3. git submodule init
  4. git submodule update
  5. mvn clean package -Pstandalone. This will include the Tesseract binaries for your platform. You can manually define the platform by providing the option -Djavacpp.platform=[PLATFORM] (available platforms are windows-x86_64, windows-x86, linux-x86_64, linux-x86, and macosx-x86_64).

After you've run through all steps, the directory "gui/target" will contain the file "tesseract4java-[VERSION]-[PLATFORM].jar", which you can run by double-clicking or executing java -jar tesseract4java-[VERSION]-[PLATFORM].jar.

Credits

License

GPLv3

tesseract4java - a graphical user interface for the Tesseract OCR engine
Copyright (C) 2014-2019 Paul Vorbach

This program is free software: you can redistribute it and/or modify
it under the terms of the GNU General Public License as published by
the Free Software Foundation, either version 3 of the License, or
(at your option) any later version.

This program is distributed in the hope that it will be useful,
but WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
GNU General Public License for more details.

You should have received a copy of the GNU General Public License
along with this program.  If not, see <http://www.gnu.org/licenses/>.

tesseract4java's People

Contributors

celebrate-future avatar dependabot[bot] avatar jlleitschuh avatar pauledwardtang avatar pvorb avatar thadguidry avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

tesseract4java's Issues

Page / Images

The 'Page' window displays thumbnail icons for the images in the Project directory.

It maybe better to label the window as Images.

Do not work on MS Windows

I can run it.
I set under General preferences Langdata to "C:\Program Files\Tesseract-OCR\tessdata"
Then when I switch to Box Editor I receive the error: Please select a traineddata file.

tesseract is installed in "C:\Program Files\Tesseract-OCR"

What is missing?

[issue] No preview displayed

The program executes without errors.

schermata del 2017-03-24 11-49-28

However, if I click on preview nothing happens.
Any ideas about what the problem could be?

I'm using Java8 on Linux Mint.

Thanks

Crashes in Box Editor

When opening the box editor while a page is selected, the program crashes

#
# A fatal error has been detected by the Java Runtime Environment:
#
#  EXCEPTION_ACCESS_VIOLATION (0xc0000005) at pc=0x0000000068a2fba9, pid=1876, tid=6992
#

Build fail (ocrevalUAtion)

I'm building on Windows 10.0.19044 Build 19044, java 18.0.1.1 2022-04-22, Apache Maven 3.8.6. I've got no experience with any of this. ocrevalUAtion is where it fails. I can provide more information if needed.

Running:

mvn clean package -Pstandalone

Get this error message:

[INFO] Scanning for projects...
[INFO] ------------------------------------------------------------------------
[INFO] Reactor Build Order:
[INFO]
[INFO] tesseract4java [pom]
[INFO] ocrevalUAtion [jar]
[INFO] tools [jar]
[INFO] gui [jar]
[INFO]
[INFO] ------------------< de.vorb.tesseract:tesseract4java >------------------
[INFO] Building tesseract4java 0.3.0-SNAPSHOT [1/4]
[INFO] --------------------------------[ pom ]---------------------------------
[INFO]
[INFO] --- maven-clean-plugin:2.5:clean (default-clean) @ tesseract4java ---
[INFO]
[INFO] ------------------< de.vorb.tesseract:ocrevalUAtion >-------------------
[INFO] Building ocrevalUAtion 0.3.0-SNAPSHOT [2/4]
[INFO] --------------------------------[ jar ]---------------------------------
[INFO] ------------------------------------------------------------------------
[INFO] Reactor Summary for tesseract4java 0.3.0-SNAPSHOT:
[INFO]
[INFO] tesseract4java ..................................... SUCCESS [ 0.130 s]
[INFO] ocrevalUAtion ...................................... FAILURE [ 0.229 s]
[INFO] tools .............................................. SKIPPED
[INFO] gui ................................................ SKIPPED
[INFO] ------------------------------------------------------------------------
[INFO] BUILD FAILURE
[INFO] ------------------------------------------------------------------------
[INFO] Total time: 0.478 s
[INFO] Finished at: 2022-06-23T23:44:14-04:00
[INFO] ------------------------------------------------------------------------
[ERROR] Failed to execute goal on project ocrevalUAtion: Could not resolve dependencies for project de.vorb.tesseract:ocrevalUAtion:jar:0.3.0-SNAPSHOT: Failed to collect dependencies at javax.media.jai:com.springsource.javax.media.jai.core:jar:1.1.3: Failed to read artifact descriptor for javax.media.jai:com.springsource.javax.media.jai.core:jar:1.1.3: Could not transfer artifact javax.media.jai:com.springsource.javax.media.jai.core:pom:1.1.3 from/to maven-default-http-blocker (http://0.0.0.0/): Blocked mirror for repositories: [com.springsource.repository.bundles.external (http://repository.springsource.com/maven/bundles/external, default, releases+snapshots)] -> [Help 1]
[ERROR]
[ERROR] To see the full stack trace of the errors, re-run Maven with the -e switch.
[ERROR] Re-run Maven using the -X switch to enable full debug logging.
[ERROR]
[ERROR] For more information about the errors and possible solutions, please read the following articles:
[ERROR] [Help 1] http://cwiki.apache.org/confluence/display/MAVEN/DependencyResolutionException
[ERROR]
[ERROR] After correcting the problems, you can resume the build with the command
[ERROR] mvn -rf :ocrevalUAtion

Java Exception occurred

I wish to recognise a minority language from Central Africa. I'm a Linux novice. My system is a fork from Linux Mint 20.1 which has openjdk1.11 and jre1.11. Installation of the 0.2.1 release with Maven, following the instructions at https://github.com/tesseract4java/tesseract4java seemed to go smoothly. However, running the .jar file per the instructions results, after a few seconds, in the report "A Java Exception has occurred." Web searches have shed no light on cause or cure.

How would I fix this? What further information is needed? Where would I find that? (Remember, I'm a novice.)

project gui: Compilation failure

[ERROR] tesseract4java-develop/gui/src/main/java/de/vorb/tesseract/
gui/controller/TesseractTrainer.java:[107,97] 找不到符号

gui build fails

I am building this on macOS (10.15.7), java 10 [version "10" 2018-03-20, SE Runtime Environment 18.3 (build 10+46)]. I have no experience with maven et all. tesseract4java, ocrevalUAtion, and tools build fine. gui fails to build, though.

The messages are:

[ERROR] Failed to execute goal org.apache.maven.plugins:maven-assembly-plugin:2.6:single (assemble-all) on project gui: Execution assemble-all of goal org.apache.maven.plugins:maven-assembly-plugin:2.6:single failed: An API incompatibility was encountered while executing org.apache.maven.plugins:maven-assembly-plugin:2.6:single: java.lang.ExceptionInInitializerError: null

[ERROR] -----------------------------------------------------
[ERROR] realm = plugin>org.apache.maven.plugins:maven-assembly-plugin:2.6
[ERROR] strategy = org.codehaus.plexus.classworlds.strategy.SelfFirstStrategy
[ERROR] urls[0] = file:/Users/y/.m2/repository/org/apache/maven/plugins/maven-assembly-plugin/2.6/maven-assembly-plugin-2.6.jar
[ERROR] urls[1] = file:/Users/y/.m2/repository/org/slf4j/slf4j-jdk14/1.5.6/slf4j-jdk14-1.5.6.jar
[ERROR] urls[2] = file:/Users/y/.m2/repository/org/slf4j/jcl-over-slf4j/1.5.6/jcl-over-slf4j-1.5.6.jar

[ ... 25 more jar urls ... ]

[ERROR] Number of foreign imports: 1 [ERROR] import: Entry[import from realm ClassRealm[maven.api, parent: null]]

The stack frame is
at org.apache.maven.lifecycle.internal.MojoExecutor.execute (MojoExecutor.java:215) at org.apache.maven.lifecycle.internal.MojoExecutor.execute (MojoExecutor.java:156) at org.apache.maven.lifecycle.internal.MojoExecutor.execute (MojoExecutor.java:148) at org.apache.maven.lifecycle.internal.LifecycleModuleBuilder.buildProject (LifecycleModuleBuilder.java:117)
[ ... ]

Several similar errors are also posted, at:
at org.apache.maven.plugin.DefaultBuildPluginManager.executeMojo (DefaultBuildPluginManager.java:186) at org.apache.maven.lifecycle.internal.MojoExecutor.execute (MojoExecutor.java:210) at org.apache.maven.lifecycle.internal.MojoExecutor.execute (MojoExecutor.java:156) at org.apache.maven.lifecycle.internal.MojoExecutor.execute (MojoExecutor.java:148)
[ ... ]
and
at org.apache.maven.plugin.DefaultBuildPluginManager.executeMojo (DefaultBuildPluginManager.java:184) at org.apache.maven.lifecycle.internal.MojoExecutor.execute (MojoExecutor.java:210) at org.apache.maven.lifecycle.internal.MojoExecutor.execute (MojoExecutor.java:156) at org.apache.maven.lifecycle.internal.MojoExecutor.execute (MojoExecutor.java:148)

Separate Image OCR and Box file features

Currently the tabs are -

  • Preprocessing
  • Box Editor
  • Symbol Overview
  • Recognition
  • Evaluation

It maybe helpful to keep the tabs related to Image OCR together -

  • Preprocessing
  • Recognition
  • Evaluation

and list the other two related to box file editing later.

Build tesseract4java errors

When I try to build I have the following errors

It seems tesseract4java built successfully. How can I test that I had success? I would like to run a small sample code

[INFO] ------------------------------------------------------------------------
[INFO] Reactor Summary for tesseract4java 0.3.0-SNAPSHOT:
[INFO]
[INFO] tesseract4java ..................................... SUCCESS [ 0.577 s]
[INFO] ocrevalUAtion ...................................... SUCCESS [ 39.425 s]
[INFO] tools .............................................. FAILURE [ 2.763 s]
[INFO] gui ................................................ SKIPPED
[INFO] ------------------------------------------------------------------------
[INFO] BUILD FAILURE
[INFO] ------------------------------------------------------------------------
[INFO] Total time: 43.182 s
[INFO] Finished at: 2020-10-14T15:47:04-04:00
[INFO] ------------------------------------------------------------------------
[ERROR] Failed to execute goal on project tools: Could not resolve dependencies for project de.vorb.tesseract:tools:jar:0.3.0-SNAPSHOT: Failed to collect dependencies at org.bytedeco.javacpp-presets:tesseract-platform:jar:4.0.0-1.4.4: Failed to read artifact descriptor for org.bytedeco.javacpp-presets:tesseract-platform:jar:4.0.0-1.4.4: Could not transfer artifact org.bytedeco.javacpp-presets:tesseract-platform:pom:4.0.0-1.4.4 from/to central (http://repo1.maven.org/maven2): Transfer failed for http://repo1.maven.org/maven2/org/bytedeco/javacpp-presets/tesseract-platform/4.0.0-1.4.4/tesseract-platform-4.0.0-1.4.4.pom 501 HTTPS Required -> [Help 1]
org.apache.maven.lifecycle.LifecycleExecutionException: Failed to execute goal on project tools: Could not resolve dependencies for project de.vorb.tesseract:tools:jar:0.3.0-SNAPSHOT: Failed to collect dependencies at org.bytedeco.javacpp-presets:tesseract-platform:jar:4.0.0-1.4.4
at org.apache.maven.lifecycle.internal.LifecycleDependencyResolver.getDependencies (LifecycleDependencyResolver.java:269)
at org.apache.maven.lifecycle.internal.LifecycleDependencyResolver.resolveProjectDependencies (LifecycleDependencyResolver.java:147)
at org.apache.maven.lifecycle.internal.MojoExecutor.ensureDependenciesAreResolved (MojoExecutor.java:248)
at org.apache.maven.lifecycle.internal.MojoExecutor.execute (MojoExecutor.java:202)
at org.apache.maven.lifecycle.internal.MojoExecutor.execute (MojoExecutor.java:156)
at org.apache.maven.lifecycle.internal.MojoExecutor.execute (MojoExecutor.java:148)
at org.apache.maven.lifecycle.internal.LifecycleModuleBuilder.buildProject (LifecycleModuleBuilder.java:117)
at org.apache.maven.lifecycle.internal.LifecycleModuleBuilder.buildProject (LifecycleModuleBuilder.java:81)
at org.apache.maven.lifecycle.internal.builder.singlethreaded.SingleThreadedBuilder.build (SingleThreadedBuilder.java:56)
at org.apache.maven.lifecycle.internal.LifecycleStarter.execute (LifecycleStarter.java:128)
at org.apache.maven.DefaultMaven.doExecute (DefaultMaven.java:305)
at org.apache.maven.DefaultMaven.doExecute (DefaultMaven.java:192)
at org.apache.maven.DefaultMaven.execute (DefaultMaven.java:105)
at org.apache.maven.cli.MavenCli.execute (MavenCli.java:957)
at org.apache.maven.cli.MavenCli.doMain (MavenCli.java:289)
at org.apache.maven.cli.MavenCli.main (MavenCli.java:193)
at sun.reflect.NativeMethodAccessorImpl.invoke0 (Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke (NativeMethodAccessorImpl.java:62)
at sun.reflect.DelegatingMethodAccessorImpl.invoke (DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke (Method.java:498)
at org.codehaus.plexus.classworlds.launcher.Launcher.launchEnhanced (Launcher.java:282)
at org.codehaus.plexus.classworlds.launcher.Launcher.launch (Launcher.java:225)
at org.codehaus.plexus.classworlds.launcher.Launcher.mainWithExitCode (Launcher.java:406)
at org.codehaus.plexus.classworlds.launcher.Launcher.main (Launcher.java:347)
Caused by: org.apache.maven.project.DependencyResolutionException: Could not resolve dependencies for project de.vorb.tesseract:tools:jar:0.3.0-SNAPSHOT: Failed to collect dependencies at org.bytedeco.javacpp-presets:tesseract-platform:jar:4.0.0-1.4.4
at org.apache.maven.project.DefaultProjectDependenciesResolver.resolve (DefaultProjectDependenciesResolver.java:178)
at org.apache.maven.lifecycle.internal.LifecycleDependencyResolver.getDependencies (LifecycleDependencyResolver.java:243)
at org.apache.maven.lifecycle.internal.LifecycleDependencyResolver.resolveProjectDependencies (LifecycleDependencyResolver.java:147)
at org.apache.maven.lifecycle.internal.MojoExecutor.ensureDependenciesAreResolved (MojoExecutor.java:248)
at org.apache.maven.lifecycle.internal.MojoExecutor.execute (MojoExecutor.java:202)
at org.apache.maven.lifecycle.internal.MojoExecutor.execute (MojoExecutor.java:156)
at org.apache.maven.lifecycle.internal.MojoExecutor.execute (MojoExecutor.java:148)
at org.apache.maven.lifecycle.internal.LifecycleModuleBuilder.buildProject (LifecycleModuleBuilder.java:117)
at org.apache.maven.lifecycle.internal.LifecycleModuleBuilder.buildProject (LifecycleModuleBuilder.java:81)
at org.apache.maven.lifecycle.internal.builder.singlethreaded.SingleThreadedBuilder.build (SingleThreadedBuilder.java:56)
at org.apache.maven.lifecycle.internal.LifecycleStarter.execute (LifecycleStarter.java:128)
at org.apache.maven.DefaultMaven.doExecute (DefaultMaven.java:305)
at org.apache.maven.DefaultMaven.doExecute (DefaultMaven.java:192)
at org.apache.maven.DefaultMaven.execute (DefaultMaven.java:105)
at org.apache.maven.cli.MavenCli.execute (MavenCli.java:957)
at org.apache.maven.cli.MavenCli.doMain (MavenCli.java:289)
at org.apache.maven.cli.MavenCli.main (MavenCli.java:193)
at sun.reflect.NativeMethodAccessorImpl.invoke0 (Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke (NativeMethodAccessorImpl.java:62)
at sun.reflect.DelegatingMethodAccessorImpl.invoke (DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke (Method.java:498)
at org.codehaus.plexus.classworlds.launcher.Launcher.launchEnhanced (Launcher.java:282)
at org.codehaus.plexus.classworlds.launcher.Launcher.launch (Launcher.java:225)
at org.codehaus.plexus.classworlds.launcher.Launcher.mainWithExitCode (Launcher.java:406)
at org.codehaus.plexus.classworlds.launcher.Launcher.main (Launcher.java:347)
Caused by: org.eclipse.aether.collection.DependencyCollectionException: Failed to collect dependencies at org.bytedeco.javacpp-presets:tesseract-platform:jar:4.0.0-1.4.4
at org.eclipse.aether.internal.impl.collect.DefaultDependencyCollector.collectDependencies (DefaultDependencyCollector.java:291)
at org.eclipse.aether.internal.impl.DefaultRepositorySystem.collectDependencies (DefaultRepositorySystem.java:284)
at org.apache.maven.project.DefaultProjectDependenciesResolver.resolve (DefaultProjectDependenciesResolver.java:169)
at org.apache.maven.lifecycle.internal.LifecycleDependencyResolver.getDependencies (LifecycleDependencyResolver.java:243)
at org.apache.maven.lifecycle.internal.LifecycleDependencyResolver.resolveProjectDependencies (LifecycleDependencyResolver.java:147)
at org.apache.maven.lifecycle.internal.MojoExecutor.ensureDependenciesAreResolved (MojoExecutor.java:248)
at org.apache.maven.lifecycle.internal.MojoExecutor.execute (MojoExecutor.java:202)
at org.apache.maven.lifecycle.internal.MojoExecutor.execute (MojoExecutor.java:156)
at org.apache.maven.lifecycle.internal.MojoExecutor.execute (MojoExecutor.java:148)
at org.apache.maven.lifecycle.internal.LifecycleModuleBuilder.buildProject (LifecycleModuleBuilder.java:117)
at org.apache.maven.lifecycle.internal.LifecycleModuleBuilder.buildProject (LifecycleModuleBuilder.java:81)
at org.apache.maven.lifecycle.internal.builder.singlethreaded.SingleThreadedBuilder.build (SingleThreadedBuilder.java:56)
at org.apache.maven.lifecycle.internal.LifecycleStarter.execute (LifecycleStarter.java:128)
at org.apache.maven.DefaultMaven.doExecute (DefaultMaven.java:305)
at org.apache.maven.DefaultMaven.doExecute (DefaultMaven.java:192)
at org.apache.maven.DefaultMaven.execute (DefaultMaven.java:105)
at org.apache.maven.cli.MavenCli.execute (MavenCli.java:957)
at org.apache.maven.cli.MavenCli.doMain (MavenCli.java:289)
at org.apache.maven.cli.MavenCli.main (MavenCli.java:193)
at sun.reflect.NativeMethodAccessorImpl.invoke0 (Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke (NativeMethodAccessorImpl.java:62)
at sun.reflect.DelegatingMethodAccessorImpl.invoke (DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke (Method.java:498)
at org.codehaus.plexus.classworlds.launcher.Launcher.launchEnhanced (Launcher.java:282)
at org.codehaus.plexus.classworlds.launcher.Launcher.launch (Launcher.java:225)
at org.codehaus.plexus.classworlds.launcher.Launcher.mainWithExitCode (Launcher.java:406)
at org.codehaus.plexus.classworlds.launcher.Launcher.main (Launcher.java:347)
Caused by: org.eclipse.aether.resolution.ArtifactDescriptorException: Failed to read artifact descriptor for org.bytedeco.javacpp-presets:tesseract-platform:jar:4.0.0-1.4.4
at org.apache.maven.repository.internal.DefaultArtifactDescriptorReader.loadPom (DefaultArtifactDescriptorReader.java:255)
at org.apache.maven.repository.internal.DefaultArtifactDescriptorReader.readArtifactDescriptor (DefaultArtifactDescriptorReader.java:171)
at org.eclipse.aether.internal.impl.collect.DefaultDependencyCollector.resolveCachedArtifactDescriptor (DefaultDependencyCollector.java:541)
at org.eclipse.aether.internal.impl.collect.DefaultDependencyCollector.getArtifactDescriptorResult (DefaultDependencyCollector.java:524)
at org.eclipse.aether.internal.impl.collect.DefaultDependencyCollector.processDependency (DefaultDependencyCollector.java:412)
at org.eclipse.aether.internal.impl.collect.DefaultDependencyCollector.processDependency (DefaultDependencyCollector.java:365)
at org.eclipse.aether.internal.impl.collect.DefaultDependencyCollector.process (DefaultDependencyCollector.java:352)
at org.eclipse.aether.internal.impl.collect.DefaultDependencyCollector.collectDependencies (DefaultDependencyCollector.java:254)
at org.eclipse.aether.internal.impl.DefaultRepositorySystem.collectDependencies (DefaultRepositorySystem.java:284)
at org.apache.maven.project.DefaultProjectDependenciesResolver.resolve (DefaultProjectDependenciesResolver.java:169)
at org.apache.maven.lifecycle.internal.LifecycleDependencyResolver.getDependencies (LifecycleDependencyResolver.java:243)
at org.apache.maven.lifecycle.internal.LifecycleDependencyResolver.resolveProjectDependencies (LifecycleDependencyResolver.java:147)
at org.apache.maven.lifecycle.internal.MojoExecutor.ensureDependenciesAreResolved (MojoExecutor.java:248)
at org.apache.maven.lifecycle.internal.MojoExecutor.execute (MojoExecutor.java:202)
at org.apache.maven.lifecycle.internal.MojoExecutor.execute (MojoExecutor.java:156)
at org.apache.maven.lifecycle.internal.MojoExecutor.execute (MojoExecutor.java:148)
at org.apache.maven.lifecycle.internal.LifecycleModuleBuilder.buildProject (LifecycleModuleBuilder.java:117)
at org.apache.maven.lifecycle.internal.LifecycleModuleBuilder.buildProject (LifecycleModuleBuilder.java:81)
at org.apache.maven.lifecycle.internal.builder.singlethreaded.SingleThreadedBuilder.build (SingleThreadedBuilder.java:56)
at org.apache.maven.lifecycle.internal.LifecycleStarter.execute (LifecycleStarter.java:128)
at org.apache.maven.DefaultMaven.doExecute (DefaultMaven.java:305)
at org.apache.maven.DefaultMaven.doExecute (DefaultMaven.java:192)
at org.apache.maven.DefaultMaven.execute (DefaultMaven.java:105)
at org.apache.maven.cli.MavenCli.execute (MavenCli.java:957)
at org.apache.maven.cli.MavenCli.doMain (MavenCli.java:289)
at org.apache.maven.cli.MavenCli.main (MavenCli.java:193)
at sun.reflect.NativeMethodAccessorImpl.invoke0 (Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke (NativeMethodAccessorImpl.java:62)
at sun.reflect.DelegatingMethodAccessorImpl.invoke (DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke (Method.java:498)
at org.codehaus.plexus.classworlds.launcher.Launcher.launchEnhanced (Launcher.java:282)
at org.codehaus.plexus.classworlds.launcher.Launcher.launch (Launcher.java:225)
at org.codehaus.plexus.classworlds.launcher.Launcher.mainWithExitCode (Launcher.java:406)
at org.codehaus.plexus.classworlds.launcher.Launcher.main (Launcher.java:347)
Caused by: org.eclipse.aether.resolution.ArtifactResolutionException: Could not transfer artifact org.bytedeco.javacpp-presets:tesseract-platform:pom:4.0.0-1.4.4 from/to central (http://repo1.maven.org/maven2): Transfer failed for http://repo1.maven.org/maven2/org/bytedeco/javacpp-presets/tesseract-platform/4.0.0-1.4.4/tesseract-platform-4.0.0-1.4.4.pom 501 HTTPS Required
at org.eclipse.aether.internal.impl.DefaultArtifactResolver.resolve (DefaultArtifactResolver.java:424)
at org.eclipse.aether.internal.impl.DefaultArtifactResolver.resolveArtifacts (DefaultArtifactResolver.java:229)
at org.eclipse.aether.internal.impl.DefaultArtifactResolver.resolveArtifact (DefaultArtifactResolver.java:207)
at org.apache.maven.repository.internal.DefaultArtifactDescriptorReader.loadPom (DefaultArtifactDescriptorReader.java:240)
at org.apache.maven.repository.internal.DefaultArtifactDescriptorReader.readArtifactDescriptor (DefaultArtifactDescriptorReader.java:171)
at org.eclipse.aether.internal.impl.collect.DefaultDependencyCollector.resolveCachedArtifactDescriptor (DefaultDependencyCollector.java:541)
at org.eclipse.aether.internal.impl.collect.DefaultDependencyCollector.getArtifactDescriptorResult (DefaultDependencyCollector.java:524)
at org.eclipse.aether.internal.impl.collect.DefaultDependencyCollector.processDependency (DefaultDependencyCollector.java:412)
at org.eclipse.aether.internal.impl.collect.DefaultDependencyCollector.processDependency (DefaultDependencyCollector.java:365)
at org.eclipse.aether.internal.impl.collect.DefaultDependencyCollector.process (DefaultDependencyCollector.java:352)
at org.eclipse.aether.internal.impl.collect.DefaultDependencyCollector.collectDependencies (DefaultDependencyCollector.java:254)
at org.eclipse.aether.internal.impl.DefaultRepositorySystem.collectDependencies (DefaultRepositorySystem.java:284)
at org.apache.maven.project.DefaultProjectDependenciesResolver.resolve (DefaultProjectDependenciesResolver.java:169)
at org.apache.maven.lifecycle.internal.LifecycleDependencyResolver.getDependencies (LifecycleDependencyResolver.java:243)
at org.apache.maven.lifecycle.internal.LifecycleDependencyResolver.resolveProjectDependencies (LifecycleDependencyResolver.java:147)
at org.apache.maven.lifecycle.internal.MojoExecutor.ensureDependenciesAreResolved (MojoExecutor.java:248)
at org.apache.maven.lifecycle.internal.MojoExecutor.execute (MojoExecutor.java:202)
at org.apache.maven.lifecycle.internal.MojoExecutor.execute (MojoExecutor.java:156)
at org.apache.maven.lifecycle.internal.MojoExecutor.execute (MojoExecutor.java:148)
at org.apache.maven.lifecycle.internal.LifecycleModuleBuilder.buildProject (LifecycleModuleBuilder.java:117)
at org.apache.maven.lifecycle.internal.LifecycleModuleBuilder.buildProject (LifecycleModuleBuilder.java:81)
at org.apache.maven.lifecycle.internal.builder.singlethreaded.SingleThreadedBuilder.build (SingleThreadedBuilder.java:56)
at org.apache.maven.lifecycle.internal.LifecycleStarter.execute (LifecycleStarter.java:128)
at org.apache.maven.DefaultMaven.doExecute (DefaultMaven.java:305)
at org.apache.maven.DefaultMaven.doExecute (DefaultMaven.java:192)
at org.apache.maven.DefaultMaven.execute (DefaultMaven.java:105)
at org.apache.maven.cli.MavenCli.execute (MavenCli.java:957)
at org.apache.maven.cli.MavenCli.doMain (MavenCli.java:289)
at org.apache.maven.cli.MavenCli.main (MavenCli.java:193)
at sun.reflect.NativeMethodAccessorImpl.invoke0 (Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke (NativeMethodAccessorImpl.java:62)
at sun.reflect.DelegatingMethodAccessorImpl.invoke (DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke (Method.java:498)
at org.codehaus.plexus.classworlds.launcher.Launcher.launchEnhanced (Launcher.java:282)
at org.codehaus.plexus.classworlds.launcher.Launcher.launch (Launcher.java:225)
at org.codehaus.plexus.classworlds.launcher.Launcher.mainWithExitCode (Launcher.java:406)
at org.codehaus.plexus.classworlds.launcher.Launcher.main (Launcher.java:347)
Caused by: org.eclipse.aether.transfer.ArtifactTransferException: Could not transfer artifact org.bytedeco.javacpp-presets:tesseract-platform:pom:4.0.0-1.4.4 from/to central (http://repo1.maven.org/maven2): Transfer failed for http://repo1.maven.org/maven2/org/bytedeco/javacpp-presets/tesseract-platform/4.0.0-1.4.4/tesseract-platform-4.0.0-1.4.4.pom 501 HTTPS Required
at org.eclipse.aether.connector.basic.ArtifactTransportListener.transferFailed (ArtifactTransportListener.java:52)
at org.eclipse.aether.connector.basic.BasicRepositoryConnector$TaskRunner.run (BasicRepositoryConnector.java:369)
at org.eclipse.aether.util.concurrency.RunnableErrorForwarder$1.run (RunnableErrorForwarder.java:75)
at org.eclipse.aether.connector.basic.BasicRepositoryConnector$DirectExecutor.execute (BasicRepositoryConnector.java:644)
at org.eclipse.aether.connector.basic.BasicRepositoryConnector.get (BasicRepositoryConnector.java:262)
at org.eclipse.aether.internal.impl.DefaultArtifactResolver.performDownloads (DefaultArtifactResolver.java:499)
at org.eclipse.aether.internal.impl.DefaultArtifactResolver.resolve (DefaultArtifactResolver.java:401)
at org.eclipse.aether.internal.impl.DefaultArtifactResolver.resolveArtifacts (DefaultArtifactResolver.java:229)
at org.eclipse.aether.internal.impl.DefaultArtifactResolver.resolveArtifact (DefaultArtifactResolver.java:207)
at org.apache.maven.repository.internal.DefaultArtifactDescriptorReader.loadPom (DefaultArtifactDescriptorReader.java:240)
at org.apache.maven.repository.internal.DefaultArtifactDescriptorReader.readArtifactDescriptor (DefaultArtifactDescriptorReader.java:171)
at org.eclipse.aether.internal.impl.collect.DefaultDependencyCollector.resolveCachedArtifactDescriptor (DefaultDependencyCollector.java:541)
at org.eclipse.aether.internal.impl.collect.DefaultDependencyCollector.getArtifactDescriptorResult (DefaultDependencyCollector.java:524)
at org.eclipse.aether.internal.impl.collect.DefaultDependencyCollector.processDependency (DefaultDependencyCollector.java:412)
at org.eclipse.aether.internal.impl.collect.DefaultDependencyCollector.processDependency (DefaultDependencyCollector.java:365)
at org.eclipse.aether.internal.impl.collect.DefaultDependencyCollector.process (DefaultDependencyCollector.java:352)
at org.eclipse.aether.internal.impl.collect.DefaultDependencyCollector.collectDependencies (DefaultDependencyCollector.java:254)
at org.eclipse.aether.internal.impl.DefaultRepositorySystem.collectDependencies (DefaultRepositorySystem.java:284)
at org.apache.maven.project.DefaultProjectDependenciesResolver.resolve (DefaultProjectDependenciesResolver.java:169)
at org.apache.maven.lifecycle.internal.LifecycleDependencyResolver.getDependencies (LifecycleDependencyResolver.java:243)
at org.apache.maven.lifecycle.internal.LifecycleDependencyResolver.resolveProjectDependencies (LifecycleDependencyResolver.java:147)
at org.apache.maven.lifecycle.internal.MojoExecutor.ensureDependenciesAreResolved (MojoExecutor.java:248)
at org.apache.maven.lifecycle.internal.MojoExecutor.execute (MojoExecutor.java:202)
at org.apache.maven.lifecycle.internal.MojoExecutor.execute (MojoExecutor.java:156)
at org.apache.maven.lifecycle.internal.MojoExecutor.execute (MojoExecutor.java:148)
at org.apache.maven.lifecycle.internal.LifecycleModuleBuilder.buildProject (LifecycleModuleBuilder.java:117)
at org.apache.maven.lifecycle.internal.LifecycleModuleBuilder.buildProject (LifecycleModuleBuilder.java:81)
at org.apache.maven.lifecycle.internal.builder.singlethreaded.SingleThreadedBuilder.build (SingleThreadedBuilder.java:56)
at org.apache.maven.lifecycle.internal.LifecycleStarter.execute (LifecycleStarter.java:128)
at org.apache.maven.DefaultMaven.doExecute (DefaultMaven.java:305)
at org.apache.maven.DefaultMaven.doExecute (DefaultMaven.java:192)
at org.apache.maven.DefaultMaven.execute (DefaultMaven.java:105)
at org.apache.maven.cli.MavenCli.execute (MavenCli.java:957)
at org.apache.maven.cli.MavenCli.doMain (MavenCli.java:289)
at org.apache.maven.cli.MavenCli.main (MavenCli.java:193)
at sun.reflect.NativeMethodAccessorImpl.invoke0 (Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke (NativeMethodAccessorImpl.java:62)
at sun.reflect.DelegatingMethodAccessorImpl.invoke (DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke (Method.java:498)
at org.codehaus.plexus.classworlds.launcher.Launcher.launchEnhanced (Launcher.java:282)
at org.codehaus.plexus.classworlds.launcher.Launcher.launch (Launcher.java:225)
at org.codehaus.plexus.classworlds.launcher.Launcher.mainWithExitCode (Launcher.java:406)
at org.codehaus.plexus.classworlds.launcher.Launcher.main (Launcher.java:347)
Caused by: org.apache.maven.wagon.TransferFailedException: Transfer failed for http://repo1.maven.org/maven2/org/bytedeco/javacpp-presets/tesseract-platform/4.0.0-1.4.4/tesseract-platform-4.0.0-1.4.4.pom 501 HTTPS Required
at org.apache.maven.wagon.providers.http.wagon.shared.AbstractHttpClientWagon.fillInputData (AbstractHttpClientWagon.java:1196)
at org.apache.maven.wagon.providers.http.wagon.shared.AbstractHttpClientWagon.fillInputData (AbstractHttpClientWagon.java:1138)
at org.apache.maven.wagon.StreamWagon.getInputStream (StreamWagon.java:126)
at org.apache.maven.wagon.StreamWagon.getIfNewer (StreamWagon.java:88)
at org.apache.maven.wagon.StreamWagon.get (StreamWagon.java:61)
at org.eclipse.aether.transport.wagon.WagonTransporter$GetTaskRunner.run (WagonTransporter.java:567)
at org.eclipse.aether.transport.wagon.WagonTransporter.execute (WagonTransporter.java:435)
at org.eclipse.aether.transport.wagon.WagonTransporter.get (WagonTransporter.java:412)
at org.eclipse.aether.connector.basic.BasicRepositoryConnector$GetTaskRunner.runTask (BasicRepositoryConnector.java:457)
at org.eclipse.aether.connector.basic.BasicRepositoryConnector$TaskRunner.run (BasicRepositoryConnector.java:364)
at org.eclipse.aether.util.concurrency.RunnableErrorForwarder$1.run (RunnableErrorForwarder.java:75)
at org.eclipse.aether.connector.basic.BasicRepositoryConnector$DirectExecutor.execute (BasicRepositoryConnector.java:644)
at org.eclipse.aether.connector.basic.BasicRepositoryConnector.get (BasicRepositoryConnector.java:262)
at org.eclipse.aether.internal.impl.DefaultArtifactResolver.performDownloads (DefaultArtifactResolver.java:499)
at org.eclipse.aether.internal.impl.DefaultArtifactResolver.resolve (DefaultArtifactResolver.java:401)
at org.eclipse.aether.internal.impl.DefaultArtifactResolver.resolveArtifacts (DefaultArtifactResolver.java:229)
at org.eclipse.aether.internal.impl.DefaultArtifactResolver.resolveArtifact (DefaultArtifactResolver.java:207)
at org.apache.maven.repository.internal.DefaultArtifactDescriptorReader.loadPom (DefaultArtifactDescriptorReader.java:240)
at org.apache.maven.repository.internal.DefaultArtifactDescriptorReader.readArtifactDescriptor (DefaultArtifactDescriptorReader.java:171)
at org.eclipse.aether.internal.impl.collect.DefaultDependencyCollector.resolveCachedArtifactDescriptor (DefaultDependencyCollector.java:541)
at org.eclipse.aether.internal.impl.collect.DefaultDependencyCollector.getArtifactDescriptorResult (DefaultDependencyCollector.java:524)
at org.eclipse.aether.internal.impl.collect.DefaultDependencyCollector.processDependency (DefaultDependencyCollector.java:412)
at org.eclipse.aether.internal.impl.collect.DefaultDependencyCollector.processDependency (DefaultDependencyCollector.java:365)
at org.eclipse.aether.internal.impl.collect.DefaultDependencyCollector.process (DefaultDependencyCollector.java:352)
at org.eclipse.aether.internal.impl.collect.DefaultDependencyCollector.collectDependencies (DefaultDependencyCollector.java:254)
at org.eclipse.aether.internal.impl.DefaultRepositorySystem.collectDependencies (DefaultRepositorySystem.java:284)
at org.apache.maven.project.DefaultProjectDependenciesResolver.resolve (DefaultProjectDependenciesResolver.java:169)
at org.apache.maven.lifecycle.internal.LifecycleDependencyResolver.getDependencies (LifecycleDependencyResolver.java:243)
at org.apache.maven.lifecycle.internal.LifecycleDependencyResolver.resolveProjectDependencies (LifecycleDependencyResolver.java:147)
at org.apache.maven.lifecycle.internal.MojoExecutor.ensureDependenciesAreResolved (MojoExecutor.java:248)
at org.apache.maven.lifecycle.internal.MojoExecutor.execute (MojoExecutor.java:202)
at org.apache.maven.lifecycle.internal.MojoExecutor.execute (MojoExecutor.java:156)
at org.apache.maven.lifecycle.internal.MojoExecutor.execute (MojoExecutor.java:148)
at org.apache.maven.lifecycle.internal.LifecycleModuleBuilder.buildProject (LifecycleModuleBuilder.java:117)
at org.apache.maven.lifecycle.internal.LifecycleModuleBuilder.buildProject (LifecycleModuleBuilder.java:81)
at org.apache.maven.lifecycle.internal.builder.singlethreaded.SingleThreadedBuilder.build (SingleThreadedBuilder.java:56)
at org.apache.maven.lifecycle.internal.LifecycleStarter.execute (LifecycleStarter.java:128)
at org.apache.maven.DefaultMaven.doExecute (DefaultMaven.java:305)
at org.apache.maven.DefaultMaven.doExecute (DefaultMaven.java:192)
at org.apache.maven.DefaultMaven.execute (DefaultMaven.java:105)
at org.apache.maven.cli.MavenCli.execute (MavenCli.java:957)
at org.apache.maven.cli.MavenCli.doMain (MavenCli.java:289)
at org.apache.maven.cli.MavenCli.main (MavenCli.java:193)
at sun.reflect.NativeMethodAccessorImpl.invoke0 (Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke (NativeMethodAccessorImpl.java:62)
at sun.reflect.DelegatingMethodAccessorImpl.invoke (DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke (Method.java:498)
at org.codehaus.plexus.classworlds.launcher.Launcher.launchEnhanced (Launcher.java:282)
at org.codehaus.plexus.classworlds.launcher.Launcher.launch (Launcher.java:225)
at org.codehaus.plexus.classworlds.launcher.Launcher.mainWithExitCode (Launcher.java:406)
at org.codehaus.plexus.classworlds.launcher.Launcher.main (Launcher.java:347)
[ERROR]
[ERROR]
[ERROR] For more information about the errors and possible solutions, please read the following articles:
[ERROR] [Help 1] http://cwiki.apache.org/confluence/display/MAVEN/DependencyResolutionException
[ERROR]
[ERROR] After correcting the problems, you can resume the build with the command
[ERROR] mvn -rf :tools

MVN?

The last step of the install states to use the command mvn. What is that?

Migrate UI to IntelliJ/JFormDesigner

When I developed this program two years ago, I used Eclipse and WindowBuilder for development.

Today I use IntelliJ IDEA, which unfortunately does not support forms created with WindowBuilder. So in order to keep development going, I need to port all forms to JFormDesigner.

Can't even get jar to execute

C:\Users\Admin>java -jar C:\Users\Admin\Desktop\tesseract4java-0.1.0-windows-x86.jar
Exception in thread "main" java.lang.UnsupportedClassVersionError: de/vorb/tesseract/gui/controller/
TesseractController : Unsupported major.minor version 52.0
at java.lang.ClassLoader.defineClass1(Native Method)
at java.lang.ClassLoader.defineClass(Unknown Source)
at java.security.SecureClassLoader.defineClass(Unknown Source)
at java.net.URLClassLoader.defineClass(Unknown Source)
at java.net.URLClassLoader.access$100(Unknown Source)
at java.net.URLClassLoader$1.run(Unknown Source)
at java.net.URLClassLoader$1.run(Unknown Source)
at java.security.AccessController.doPrivileged(Native Method)
at java.net.URLClassLoader.findClass(Unknown Source)
at java.lang.ClassLoader.loadClass(Unknown Source)
at sun.misc.Launcher$AppClassLoader.loadClass(Unknown Source)
at java.lang.ClassLoader.loadClass(Unknown Source)
at sun.launcher.LauncherHelper.checkAndLoadMain(Unknown Source)


Java Virtual Machine Launcher

A Java Exception has occurred.

OK

Am I using the wrong version of Java or something?

Make all labels translateable

Currently only a portion of the labels shown in the GUI come from the "labels.properties" file.

All visible labels should be translated from the labels.properties instead. As a second step, Spring's MessageSource could be used to handle various languages.

When there is more than one language, a language choice should be made available in the GUI, but this is not required for milestone 0.2.

open tif

I installed the program without any failure messages And when I open the program and say file: new project and try to open a tif then I see a sandglass but the file doesn't open. What am I doing wrong
image

Support less-common color models

We should support less-common color models like BGR colors or 32-bit depth in order to make the program work with more images out-of-the-box.

Problem loading some png files

While I am able to view and OCR the tifs provided with tesseract for testing (phototest.tif and eurotext.tif) the program does not load the attached png files. These are in devanagari font. I was trying to test whether the overlapping boxes were because of multipage tifs.

I am attaching files.
2016-09-06 1
sanskrit-big

sanskrit-small

thumbnails are created for these files but no other processing seems to happen.

begining with your programm

Hello
I am very interested in your work.
I downloaded Tesseract from the Github repository of the University of Mannheim https://github.com/UB-Mannheim/tesseract
I made a new porject Then I made links between Tesseract and your software (preferences). I gets no results for recognition ("please select a trainnig file). I'm really sorry to bother you but I still insist, because your work represents a very big step forward for me. I search for explanations on your repository but without success.

Thank you for your help.

NO bundled traineddata files

Currently the program includes tesseract binaries but does not include traineddata files.
Ref - #19

https://github.com/tesseract4java/tesseract4java/wiki/Usage says

Before you can start creating new projects, you have to set the environment variable TESSDATA_PREFIX, which points at the directory that contains your "tessdata" directory. Usually this environment variable is set when you install Tesseract.

This requires the user to either install Tesseract or download "tessdata" directory and set the the environment variable TESSDATA_PREFIX.

Installing Tesseract separately can lead to conflict while running program. Asking the user to download "tessdata" is also not straightforward - eg.


Hence I would suggest that at a minimum osd.traineddata, eng.* files and other required tessdata files be included with the program and kept in tessdata subdirectory relative to the binaries for the program.

Secondly, I would suggest that program add the option of downloading tessdata files for selected languages (a few other languages have multiple files in tessdata - eg. ara, hin etc) that user can use.

Both VietOCR and Tesseract from UB Mannaheim include these options - please see

Error opening data file

Language cannot be loaded due to its path being defined as /home/travis/...

java -jar tesseract4java-0.3.0-SNAPSHOT-standalone.jar Error opening data file /home/travis/build/javacpp-presets/tesseract/cppbuild/linux-x86_64/share/tessdata/eng.traineddata Please make sure the TESSDATA_PREFIX environment variable is set to your "tessdata" directory. Failed loading language 'eng' Tesseract couldn't load any languages!

Includes pre-compiled Tesseract (3.04.01) for your platform???

Announcement email says:

Includes pre-compiled Tesseract (3.04.01) for your platform, just click and run it! Traineddata is not included, though.

However, when running the program, edit preferences needs location of tesseract executable.

Also, there is no way to make that path selection blank for the location of tesseract executable to use the bundled binary.

(32bit version running on windows 10)

Can't set tesseract variables

Hello,
I'm trying to use tesseract4java, allowing the personalization of tesseract parameters.
It seems your way to set variables doesn't work...

This is the variables set in the PageRecognitionProducer's constructor:
scrre2

And this is how the variables are inserted in tesseract at the end of the method reset of the same class:
scrre1

Have you any idea of which can be the problem? I'm trying to use a personal dictionary, using tesseract from the terminal all seems to work fine...
Thanks!

This program needs work

Have tried many files - tiff, png, jpg. The program will hang either before showing thumbnail processing, or on hourglass while processing thumbnails. Once I actually got it to display an image ... no OCR however.
Full of bugs. Don't waste your time.

Does Box Editor create box file?

I was under the impression that a box file generated separately was to be given to tsseract4java for viewing and editing.

But it seems that box file is being generated by the program also.

Need clarification on this point.

Allow box file editing via Symbol Overview window

Symbol overview window shows the various symbols for a selected glyph.

Please allow for selection of symbols in the overview window and their deletion from box file.

For example in attached box file a large number of symbols are incorrect, allow for multi-selection and deletion from box file.

image

image

Allow for Font Selection

Recognition tab has a font selection dropdown, the only two choices in it are Antiqua and Fraktur.

Devanagari script is depicted as rectangular boxes with Antiqua. It is displayed with Fraktur, however it is not correct for all letters - there are some overlaps. . See attached images.

image

image

However, when switching to evaluation tab, the font selection is NOT saved and the transcription box shows square boxes. See attached image below.

image

Please allow for a global font selection by user, maybe under options or preferences so that the text can be rendered correctly on all tabs.

Thanks!

Introduce Spring framework

Currently there's a lot of coupling between components. DI via Spring will help to decouple components.

[question] how to run tesseract4java on ubuntu?

Hi, first of all thank you for this program, tesseract really needs a gui. I followed the usage page in the wiki however I still am unable to process my images.

I am on Lubuntu 18.04 and I installed tesseract from the package manager. Then i downloaded the jar for tesseract4java.

I created a new project and can load images, however it seems that the issue is the traineddata file. However through the package manager /synaptic) I installed "tesseract-ocr language files for English" which according to the description "This package contains the data needed for processing images in English language."

Tesseract4java however can't find that on my system I already have such file, and from the program I can't find a way to load it.

Could you help?

I also have a few questions about the traineddata file.

Even though I am on lubuntu, do I have to use such trainedata file? If yes where do I have to put them?

Is there a way to use the files from my software manager?

Thank you :)

Information about training

Hello!

I'd like to try the training feature of tesseract using the tesseract4java GUI.
I saw that there is an option to do it under Tools, but I can't find any information on how to use it (I found only a link to the tesseract wiki, but since I'm a beginner with tesseract it's a bit too much).

Shall someone give me some tips on how to use it?

Does this still work ?

Does this still work ? i literally cant get it to do anything, preview button doesnt do anything, tesseract trainer only results in a null error with nothing written in the trainer log,
image

windows 10 installation issue

When I run the executable jar file tesseract4java-0.1.0-windows-x86_64.jar on my computer, nothing occur. Did I made a mistake? (message: No jnelpt libray in java path)

My config: Windows 10 64 bits java 8.101

Training File / Traineddata / tessdata

The GUI has a window for 'Training File'

It maybe better to label it as 'Traineddata' or tessdata since it displays the available traineddata files.

If traineddata files are available in the same directory as the .jar file, those are displayed.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.