{"id":26280791,"url":"https://github.com/tattersoftware/wp-tesseract","last_synced_at":"2026-05-22T14:14:15.824Z","repository":{"id":35670346,"uuid":"216213558","full_name":"tattersoftware/wp-tesseract","owner":"tattersoftware","description":"A WordPress OCR plugin for extracting the text from attached images using Tesseract.","archived":false,"fork":false,"pushed_at":"2023-09-09T10:25:21.000Z","size":828,"stargazers_count":1,"open_issues_count":1,"forks_count":1,"subscribers_count":2,"default_branch":"develop","last_synced_at":"2024-05-01T09:41:20.311Z","etag":null,"topics":["ocr","optical-text-recognition","tesseract","wordpress","wordpress-plugin"],"latest_commit_sha":null,"homepage":"","language":"PHP","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/tattersoftware.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2019-10-19T13:48:58.000Z","updated_at":"2022-03-05T13:00:55.000Z","dependencies_parsed_at":"2022-08-08T10:16:07.212Z","dependency_job_id":null,"html_url":"https://github.com/tattersoftware/wp-tesseract","commit_stats":null,"previous_names":[],"tags_count":6,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/tattersoftware%2Fwp-tesseract","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/tattersoftware%2Fwp-tesseract/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/tattersoftware%2Fwp-tesseract/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/tattersoftware%2Fwp-tesseract/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/tattersoftware","download_url":"https://codeload.github.com/tattersoftware/wp-tesseract/tar.gz/refs/heads/develop","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":243597836,"owners_count":20316845,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["ocr","optical-text-recognition","tesseract","wordpress","wordpress-plugin"],"created_at":"2025-03-14T15:17:51.335Z","updated_at":"2026-05-22T14:14:10.802Z","avatar_url":"https://github.com/tattersoftware.png","language":"PHP","funding_links":[],"categories":[],"sub_categories":[],"readme":"# WP Tesseract  \n\nWordPress Plugin for Tesseract.\n\n## Description \n\nA plugin for extracting text from attached images using\n[OCR](http://en.wikipedia.org/wiki/Optical_character_recognition) via [Tesseract](https://github.com/tesseract-ocr/).\nThis plugin adds a new post named for each image upload containing any recognized text characters within the file.\nThis text can then be edited for accuracy and used elsewhere on the site.\n\nThe OCR plugin requires a supported version of PHP with the GD extension and the following command line utility:\n* [Tesseract](https://github.com/tesseract-ocr/) for the actual OCR\nThis utility must be manually installed on your server and executable by PHP.\n**This process, and consequently this plugin, is recommended only for advanced users.**\n\n\n## Installation\n\n1. Install Tesseract OCR on your server ([Tesseract wiki](https://github.com/tesseract-ocr/tesseract/wiki/))\n2. Search and add the plugin from WordPress, or upload a copy of the source to your `/wp-content/plugins/` directory\n3. Activate the plugin through the `Plugins` menu in WordPress\n4. Configure the plugin through the `Settings \u003e Tesseract` link in the sidebar menu in WordPress\n\n\n## Frequently Asked Questions\n\n### What is Tesseract OCR and where do I get it?\n\nTesseract OCR is an open source [optical character recognition](http://en.wikipedia.org/wiki/Optical_character_recognition)\nlibrary that the WordPress OCR plugin uses to extract text from images. The library as\nwell as installation instructions can be found at\n[https://github.com/tesseract-ocr/tesseract/wiki/](https://github.com/tesseract-ocr/tesseract/wiki/).\n\n### How do I know if / where I have Tesseract installed on my server?\n\nLinux:\n\n1. SSH into your server and type `which tesseract`.\n2. If Tesseract is installed and in your shell environment PATH the terminal should return a path similar to `/opt/local/bin/tesseract`.\n3. Place this path in the configuration of the OCR plugin through the `Settings \u003e Tesseract` link in the sidebar menu in WordPress\n\n### Where is the detected text stored? \n\nThe text detected by the OCR plugin is added as a new post, named after the image file.\n\n### What is the 'Resize percentage' configuration option?\n\nThe OCR plugin is tailored to detect text in images with ~12pt text at 72dpi. GD\nis used to upscale the temporary images fed to Tesseract as Tesseract is generally\nmore accurate with larger type, even if it's been upscaled from a smaller source. If you\nwish to disable this option simply set this configuration option to `100%` and no resizing\nwill occur.\n\n### What if I just want to use the plugin but not install anything?\n\nHosting options are available. See [https://tattersoftware.com](https://tattersoftware.com)\nfor contact info.\n\n## Attribution\n\nThe plugin's banner photo is by Ekrulila from Pexels.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Ftattersoftware%2Fwp-tesseract","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Ftattersoftware%2Fwp-tesseract","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Ftattersoftware%2Fwp-tesseract/lists"}