https://github.com/wintermi/ocr-runner
OCR Runner - Command Line Application for processing image files using Google Cloud Vision API and Google Cloud Document AI.
https://github.com/wintermi/ocr-runner
cloud-vision cloud-vision-api document-ai google-cloud google-cloud-platform
Last synced: 2 months ago
JSON representation
OCR Runner - Command Line Application for processing image files using Google Cloud Vision API and Google Cloud Document AI.
- Host: GitHub
- URL: https://github.com/wintermi/ocr-runner
- Owner: wintermi
- License: apache-2.0
- Created: 2023-06-12T08:04:39.000Z (about 2 years ago)
- Default Branch: main
- Last Pushed: 2025-04-18T12:18:37.000Z (3 months ago)
- Last Synced: 2025-04-27T00:26:17.247Z (3 months ago)
- Topics: cloud-vision, cloud-vision-api, document-ai, google-cloud, google-cloud-platform
- Language: Go
- Homepage:
- Size: 272 KB
- Stars: 4
- Watchers: 1
- Forks: 1
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# OCR Runner
[](https://github.com/wintermi/ocr-runner/actions)
[](https://goreportcard.com/report/github.com/wintermi/ocr-runner)
[](https://github.com/wintermi/ocr-runner/blob/main/LICENSE)
[](https://github.com/wintermi/ocr-runner/releases)## Description
A command line application designed to recursively walk through the input path submitting all image files for optical character recognition (OCR) via either the Google Cloud Vision API or a Google Cloud Document AI processor if a prediction endpoint is provided. The application will then output the image information and annotations to a single newline delimited JSON File.
```
USAGE:
ocr-runner -i PATH -o FILEARGS:
-endpoint string
Document AI Prediction Endpoint (Optional)
-full
Output full details to JSON
-i string
Input Path (Required)
-o string
Output File (Required)
-verbose
Display verbose or debug detail
```## Valid File Extensions
The application will automatically filter out all files that do not have one of the following extensions:
- `.bmp`
- `.gif`
- `.jpg`
- `.jpeg`
- `.pdf`
- `.png`
- `.tif`
- `.tiff`
- `.webp`## License
**ocr-runner** is released under the [Apache License 2.0](https://github.com/wintermi/ocr-runner/blob/main/LICENSE) unless explicitly mentioned in the file header.