An open API service indexing awesome lists of open source software.

https://github.com/milahu/hocr-editor-qt

graphical HOCR editor to produce minimal diffs for proofreading of tesseract OCR output
https://github.com/milahu/hocr-editor-qt

cst-editor hocr hocr-editor minimal-diff ocr-post-processing ocr-postprocessing ocr-proofreading proofreading tesseract tesseract-ocr

Last synced: 4 months ago
JSON representation

graphical HOCR editor to produce minimal diffs for proofreading of tesseract OCR output

Awesome Lists containing this project

README

          

# hocr-editor-qt

graphical HOCR editor to produce minimal diffs for proofreading of tesseract OCR output

## usage

```
python hocr-editor.py test/data/mit-license-template/mit-license-template.hocr
```

## screenshot

![hocr-editor-qt screenshot](doc/hocr-editor-qt-screenshot.webp)

## install

### Linux

#### NixOS Linux

```
nix-shell -p git
git clone https://github.com/milahu/hocr-editor-qt
cd hocr-editor-qt
nix-shell
```

#### Debian Linux

```
sudo apt install git python3

git clone https://github.com/milahu/hocr-editor-qt
cd hocr-editor-qt
pip install -r requirements.txt
```

### Windows

install `git` and `python3` with the [chocolatey package manager](https://chocolatey.org/install)

powershell → rightclick → run as admin

```
choco install git python3
```

now in a non-admin powershell

```
git clone https://github.com/milahu/hocr-editor-qt
cd hocr-editor-qt
pip install -r requirements.txt
```