Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/wintermi/ocr-runner
OCR Runner - Command Line Application for processing image files using Google Cloud Vision API and Google Cloud Document AI.
https://github.com/wintermi/ocr-runner
cloud-vision cloud-vision-api document-ai google-cloud google-cloud-platform
Last synced: 7 days ago
JSON representation
OCR Runner - Command Line Application for processing image files using Google Cloud Vision API and Google Cloud Document AI.
- Host: GitHub
- URL: https://github.com/wintermi/ocr-runner
- Owner: wintermi
- License: apache-2.0
- Created: 2023-06-12T08:04:39.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2024-08-16T14:24:08.000Z (3 months ago)
- Last Synced: 2024-08-16T15:50:53.230Z (3 months ago)
- Topics: cloud-vision, cloud-vision-api, document-ai, google-cloud, google-cloud-platform
- Language: Go
- Homepage:
- Size: 217 KB
- Stars: 3
- Watchers: 2
- Forks: 1
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# OCR Runner
[![Workflows](https://github.com/wintermi/ocr-runner/workflows/Go/badge.svg)](https://github.com/wintermi/ocr-runner/actions)
[![Go Report](https://goreportcard.com/badge/github.com/wintermi/ocr-runner)](https://goreportcard.com/report/github.com/wintermi/ocr-runner)
[![License](https://img.shields.io/github/license/wintermi/ocr-runner)](https://github.com/wintermi/ocr-runner/blob/main/LICENSE)
[![Release](https://img.shields.io/github/v/release/wintermi/ocr-runner?include_prereleases)](https://github.com/wintermi/ocr-runner/releases)## Description
A command line application designed to recursively walk through the input path submitting all image files for optical character recognition (OCR) via either the Google Cloud Vision API or a Google Cloud Document AI processor if a prediction endpoint is provided. The application will then output the image information and annotations to a single newline delimited JSON File.
```
USAGE:
ocr-runner -i PATH -o FILEARGS:
-endpoint string
Document AI Prediction Endpoint (Optional)
-full
Output full details to JSON
-i string
Input Path (Required)
-o string
Output File (Required)
-verbose
Display verbose or debug detail
```## Valid File Extensions
The application will automatically filter out all files that do not have one of the following extensions:
- `.bmp`
- `.gif`
- `.jpg`
- `.jpeg`
- `.pdf`
- `.png`
- `.tif`
- `.tiff`
- `.webp`## License
**ocr-runner** is released under the [Apache License 2.0](https://github.com/wintermi/ocr-runner/blob/main/LICENSE) unless explicitly mentioned in the file header.