{"id":16133814,"url":"https://github.com/wallarug/pytesser","last_synced_at":"2026-02-16T10:35:48.038Z","repository":{"id":72814137,"uuid":"54962959","full_name":"wallarug/pytesser","owner":"wallarug","description":"Automatically exported from code.google.com/p/pytesser","archived":false,"fork":false,"pushed_at":"2016-03-29T09:27:08.000Z","size":1870,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"master","last_synced_at":"2025-02-08T01:43:28.049Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"other","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/wallarug.png","metadata":{"files":{"readme":"README","changelog":"ChangeLog","contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":"AUTHORS","dei":null,"publiccode":null,"codemeta":null}},"created_at":"2016-03-29T09:24:36.000Z","updated_at":"2016-03-29T09:25:18.000Z","dependencies_parsed_at":"2023-02-23T12:31:02.546Z","dependency_job_id":null,"html_url":"https://github.com/wallarug/pytesser","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/wallarug%2Fpytesser","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/wallarug%2Fpytesser/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/wallarug%2Fpytesser/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/wallarug%2Fpytesser/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/wallarug","download_url":"https://codeload.github.com/wallarug/pytesser/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":246802555,"owners_count":20836365,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-10-09T22:46:04.161Z","updated_at":"2025-10-05T12:07:54.944Z","avatar_url":"https://github.com/wallarug.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"Introduction:\r\n============\r\nPyTesser is an Optical Character Recognition module for Python. It takes \r\nas input an image or image file and outputs a string.\r\n\r\nPyTesser uses the Tesseract OCR engine (an Open Source project at Google), \r\nconverting images to an accepted format and calling the Tesseract \r\nexecutable as an external script. A Windows executable is provided \r\nalong with the Python scripts. The scripts should work in Linux as well. \r\n\r\nPyTesser:\r\nhttp://code.google.com/p/pytesser/\r\nTesseract:\r\nhttp://code.google.com/p/tesseract-ocr/\r\n\r\n\r\nDependencies:\r\n=============\r\nPIL is required to work with images in memory. PyTesser has been tested with Python 2.4 in Windows XP. \r\nhttp://www.pythonware.com/products/pil/\r\n\r\n\r\nInstallation:\r\n==============\r\nPyTesser has no installation functionality in this release.  Extract pytesser.zip\r\ninto directory with other scripts.  Necessary files are listed in File Dependencies below.  \r\n\r\n\r\nUsage:\r\n================================\r\n\u003e\u003e\u003e from pytesser import *\n\u003e\u003e\u003e im = Image.open('phototest.tif')\n\u003e\u003e\u003e text = image_to_string(im)\n\u003e\u003e\u003e print text\nThis is a lot of 12 point text to test the\nocr code and see if it works on all types\nof file format.\nThe quick brown dog jumped over the\nlazy fox. The quick brown dog jumped\nover the lazy fox. The quick brown dog\njumped over the lazy fox. The quick\nbrown dog jumped over the lazy fox.\n\n\u003e\u003e\u003e try:\n... \ttext = image_file_to_string('fnord.tif', graceful_errors=False)\n... except errors.Tesser_General_Exception, value:\n... \tprint \"fnord.tif is incompatible filetype.  Try graceful_errors=True\"\n... \tprint value\n... \t\nfnord.tif is incompatible filetype.  Try graceful_errors=True\nTesseract Open Source OCR Engine\nread_tif_image:Error:Illegal image format:Compression\nTessedit:Error:Read of file failed:fnord.tif\nSignal_exit 31 ABORT. LocCode: 3  AbortCode: 3\n\n\u003e\u003e\u003e text = image_file_to_string('fnord.tif', graceful_errors=True)\n\u003e\u003e\u003e print \"fnord.tif contents:\", text\nfnord.tif contents: fnord\n\n\u003e\u003e\u003e text = image_file_to_string('fonts_test.png', graceful_errors=True)\n\u003e\u003e\u003e print text\n12 pt\nAnd Arnazwngw few dwscotheques provwde jukeboxes\nTames Amazmgly few dnscotheques pmvxde Jukeboxes\n24 pt:\nArial: Amazingly few discotheques\nprovide jul\u003cebo\u003e\u003ces.\nCourier: Ama zimgly few\ndiscotheque S provide\nj u k e b ox e S .\nTimes: Amazingly few discotheques provide\njukeboxes.\r\n\r\n\r\nFile Dependencies:\r\n============================================\r\npytesser.py\tMain module for importing\r\nutil.py\t\tUtility functions used by pytesser.py\r\nerrors.py\tInterprets exceptions thrown by Tesseract\r\ntesseract.exe\tExecutable called by pytesser.py\r\ntessdata/\tResources used by tesseract.exe\r\n\r\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fwallarug%2Fpytesser","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fwallarug%2Fpytesser","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fwallarug%2Fpytesser/lists"}