https://github.com/kamui-fin/gazou
Japanese and Chinese OCR for Linux & Windows
https://github.com/kamui-fin/gazou
imageprocessing japanese ocr
Last synced: 2 months ago
JSON representation
Japanese and Chinese OCR for Linux & Windows
- Host: GitHub
- URL: https://github.com/kamui-fin/gazou
- Owner: kamui-fin
- License: gpl-3.0
- Created: 2020-08-28T20:10:57.000Z (almost 5 years ago)
- Default Branch: master
- Last Pushed: 2023-10-20T03:49:40.000Z (over 1 year ago)
- Last Synced: 2025-04-14T23:55:44.086Z (2 months ago)
- Topics: imageprocessing, japanese, ocr
- Language: C++
- Homepage:
- Size: 70.7 MB
- Stars: 96
- Watchers: 3
- Forks: 13
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Gazou OCR
Gazou is a Japanese and Chinese OCR application written in C++. It utilizes image processing, and other parameters to improve the accuracy. All contributions are welcome.
## Installation
### Windows
You can grab the latest portable zip build from the [releases](https://github.com/kamui-fin/gazou/releases). Simply unzip and execute `gazou.exe` and the app should run.
### Arch
Since this is packaged in the AUR, you can install it with any AUR helper like:
```
paru -S gazou-git
```## Building from source
### Dependencies
These dependencies will need to be installed by your system's package manager:
- Qt5 >= 5.10
- Tesseract >= 4.0.0
- Leptonica >= 1.70Optional dependencies:
- Qt5X11Extras >= 5.10 (for GUI)
### Install
```sh
git clone --recursive https://github.com/kamui-fin/gazou.git
cd gazou
mkdir build
cd build
cmake .. -DGUI=ON
sudo make install
```#### Wayland
For wayland users, `gazou` must be compiled with `-DGUI=OFF` and use the equally functional command line interface.
This is due to the fact that Wayland does not support applications registering their own global hotkeys as a security measure.## GUI
You can run the program by typing `gazou` in your terminal. It runs in the background and should appear in your system tray.
If you click on the app in the tray, a settings dialog should appear.
In here, you can customize the keybinds to your liking. The default hotkeys are:- `Alt+A`: Vertical OCR
- `Alt+D`: Horizontal OCR
- `Alt+S`: Repeat the previous OCRWhen you perform an OCR, the result text gets copied to the clipboard.
Just to clarify the language option names:
- `jpn`: Japanese
- `chi_sim`: Simplified Chinese
- `chi_trad`: Traditional Chinese## CLI
Gazou also has a command line mode, and this can be useful for integrating it with bash scripts.
To get the resulting text copied to your clipboard, you can use `xclip`, `wl-copy`, or any clipboard utility you prefer.
Available options include:```
Usage: gazou [options] imagePath
Launches GUI if no options are provided.Options:
-p, --prevscan Run the OCR on the same coordinates of the
previous scan
-l, --language Specify OCR language, defaults to jpn. Options:
jpn, chi_sim, chi_trad
-v, --vertical Switch orientation to vertical. Without this,
gazou expects horizontal text.
--version Fetch the version information of gazou
--help View this help menuArguments:
imagePath Source image file to OCR
```### Piping from stdin
You can also run gazou by pipeing an image into the CLI:
```
cat img.png | gazou
```