{"id":21295689,"url":"https://github.com/tjkessler/tesseract-positional","last_synced_at":"2025-03-15T17:14:02.008Z","repository":{"id":57474381,"uuid":"185096670","full_name":"tjkessler/tesseract-positional","owner":"tjkessler","description":"Tool to save positional OCR data to a text file","archived":false,"fork":false,"pushed_at":"2023-08-02T22:07:18.000Z","size":331,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":2,"default_branch":"master","last_synced_at":"2025-02-22T15:02:42.084Z","etag":null,"topics":["ocr-python","ocr-recognition","ocr-text-reader","tesseract","tesseract-ocr"],"latest_commit_sha":null,"homepage":null,"language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/tjkessler.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2019-05-06T00:11:44.000Z","updated_at":"2023-08-02T22:02:51.000Z","dependencies_parsed_at":"2025-01-22T06:47:29.844Z","dependency_job_id":"5ae0dde5-2f1c-4f27-b158-1937400d2008","html_url":"https://github.com/tjkessler/tesseract-positional","commit_stats":null,"previous_names":["tjkessler/tesseract_positional"],"tags_count":3,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/tjkessler%2Ftesseract-positional","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/tjkessler%2Ftesseract-positional/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/tjkessler%2Ftesseract-positional/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/tjkessler%2Ftesseract-positional/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/tjkessler","download_url":"https://codeload.github.com/tjkessler/tesseract-positional/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":243762267,"owners_count":20343979,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["ocr-python","ocr-recognition","ocr-text-reader","tesseract","tesseract-ocr"],"created_at":"2024-11-21T14:06:49.433Z","updated_at":"2025-03-15T17:14:01.986Z","avatar_url":"https://github.com/tjkessler.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Tesseract-Positional\nA tool to save positional OCR data to a text file\n\n[![GitHub version](https://badge.fury.io/gh/tjkessler%2Ftesseract-positional.svg)](https://badge.fury.io/gh/tjkessler%2Ftesseract-positional)\n[![PyPI version](https://badge.fury.io/py/tesseract-positional.svg)](https://badge.fury.io/py/tesseract-positional)\n[![GitHub license](https://img.shields.io/badge/license-MIT-blue.svg)](https://raw.githubusercontent.com/tjkessler/tesseract-positional/master/LICENSE.txt)\n\nTesseract-Positional allows positional data extracted using OCR to be saved as plain-text. Positional data includes text spacing and line breaks.\n\n## Installation\n\nInstallation via pip:\n\n```\n$ pip install tesseract-positional\n```\n\nInstallation via cloned repository:\n\n```\n$ git clone https://github.com/tjkessler/tesseract-positional\n$ cd tesseract-positional\n$ pip install .\n```\n\nAdditional dependencies (pytesseract, Pillow) will be installed during Tesseract-Positional's installation.\n\n## Basic Usage\n\n### Via a Python script\n\nSaving extracted text to a file:\n\n```python\nfrom tesseract_positional import positional_ocr\npositional_ocr('image.tiff', 'output.txt')\n```\n\nObtaining extracted text:\n\n```python\nfrom tesseract_positional import positional_ocr\ntext = positional_ocr('image.tiff')\n```\n\n### Via the command line\n\n```\n$ tesseract-positional image.tiff output.txt\n```\n\n## Contributing, Reporting Issues and Other Support\n\nTo contribute to Tesseract-Positional, make a pull request. Contributions should include tests for new features added, as well as extensive documentation.\n\nTo report problems with the software or feature requests, file an issue. When reporting problems, include information such as error messages, your OS/environment and Python version.\n\nFor additional support/questions, contact Travis Kessler (travis.j.kessler@gmail.com).\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Ftjkessler%2Ftesseract-positional","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Ftjkessler%2Ftesseract-positional","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Ftjkessler%2Ftesseract-positional/lists"}