An open API service indexing awesome lists of open source software.

https://github.com/tesseract4java/tesseract4java

Java GUI and Tools for Tesseract OCR
https://github.com/tesseract4java/tesseract4java

gui java ocr swing tesseract-ocr

Last synced: 4 days ago
JSON representation

Java GUI and Tools for Tesseract OCR

Awesome Lists containing this project

README

          

# tesseract4java: Tesseract GUI

A graphical user interface for the [Tesseract OCR engine][tesseract]. The program has been introduced in the [Master’s
thesis “Analyses and Heuristics for the Improvement of Optical Character Recognition Results for Fraktur Texts”][thesis]
by Paul Vorbach (German).

[tesseract]: https://github.com/tesseract-ocr/tesseract
[thesis]: http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-106527

[![Donate with PayPal](https://www.paypalobjects.com/en_US/i/btn/btn_donate_SM.gif)](https://www.paypal.com/cgi-bin/webscr?cmd=_s-xclick&hosted_button_id=LF8T2JP2YUUUE)

## Usage
Basic usage is documented on [our wiki page](https://github.com/tesseract4java/tesseract4java/wiki/Usage)

## Download

Binary distributions and release notes are available in the [releases section].

[Releases section]: https://github.com/tesseract4java/tesseract4java/releases

## Screenshots

![Preprocessing](https://raw.githubusercontent.com/tesseract4java/tesseract-gui/master/screenshots/gui-preprocessing.png)

Preprocessing view

![Box Editor](https://raw.githubusercontent.com/tesseract4java/tesseract-gui/master/screenshots/gui-box-editor.png)

Box editor for training

![Glyph Overview](https://raw.githubusercontent.com/tesseract4java/tesseract-gui/master/screenshots/gui-glyph-overview.png)

Glyph overview for easier detection of errors

![Comparison View](https://raw.githubusercontent.com/tesseract4java/tesseract-gui/master/screenshots/gui-comparison.png)

Comparison view to compare the original document with the perceived result

![Transcription View](https://raw.githubusercontent.com/tesseract4java/tesseract-gui/master/screenshots/gui-evaluation.png)

Evaluation view with a transcription field

![ocrevalUAtion](https://raw.githubusercontent.com/tesseract4java/tesseract-gui/master/screenshots/ocrevaluation.png)

[ocrevalUAtion]

![Batch Export](https://raw.githubusercontent.com/tesseract4java/tesseract-gui/master/screenshots/gui-batch-export.png)

Batch export functionality to handle large projects

## Building and running the software

This software is written in Java and can be built using [Apache Maven]. In order to build the software you have to
follow these steps:

1. `git clone https://github.com/tesseract4java/tesseract4java.git`
2. `cd tesseract4java`
3. `git submodule init`
4. `git submodule update`
5. `mvn clean package -Pstandalone`. This will include the Tesseract binaries for your platform. You can manually
define the platform by providing the option `-Djavacpp.platform=[PLATFORM]` (available platforms are
`windows-x86_64`, `windows-x86`, `linux-x86_64`, `linux-x86`, and `macosx-x86_64`).

After you've run through all steps, the directory "gui/target" will contain the file
"tesseract4java-[VERSION]-[PLATFORM].jar", which you can run by double-clicking or executing
`java -jar tesseract4java-[VERSION]-[PLATFORM].jar`.

[Apache Maven]: https://maven.apache.org/

## Credits

- This software uses the [Tesseract OCR engine][tesseract] ([APLv2.0]).
- This software uses [ocrevalUAtion] by Rafael C. Carrasco for providing
accuracy measures of the OCR results ([GPLv3]).
- This software uses the [Silk icon set][silk] by Mark James
([famfamfam.com]) ([CC-BY-3.0]).

[APLv2.0]: http://www.apache.org/licenses/LICENSE-2.0
[GPLv3]: https://www.gnu.org/licenses/gpl-3.0.html
[ocrevalUAtion]: https://github.com/impactcentre/ocrevalUAtion
[silk]: http://www.famfamfam.com/lab/icons/silk/
[famfamfam.com]: http://www.famfamfam.com/
[CC-BY-3.0]: http://creativecommons.org/licenses/by/3.0/

## License

GPLv3

~~~
tesseract4java - a graphical user interface for the Tesseract OCR engine
Copyright (C) 2014-2019 Paul Vorbach

This program is free software: you can redistribute it and/or modify
it under the terms of the GNU General Public License as published by
the Free Software Foundation, either version 3 of the License, or
(at your option) any later version.

This program is distributed in the hope that it will be useful,
but WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
GNU General Public License for more details.

You should have received a copy of the GNU General Public License
along with this program. If not, see .
~~~