{"id":13564520,"url":"https://github.com/yardstick17/image_text_reader","last_synced_at":"2026-01-21T09:03:52.864Z","repository":{"id":156310045,"uuid":"84854678","full_name":"yardstick17/image_text_reader","owner":"yardstick17","description":"The module extracts text from image using the tesseract-OCR engine. Generally, text present in the images are blur or are of uneven sizes. The image is pre-processed for better comprehension by OCR. This module first makes bounding box for text in images and then normalizes it to 300 dpi, suitable for OCR engine to read.","archived":false,"fork":false,"pushed_at":"2019-04-03T12:22:37.000Z","size":6616,"stargazers_count":147,"open_issues_count":5,"forks_count":43,"subscribers_count":12,"default_branch":"master","last_synced_at":"2025-04-03T21:35:40.472Z","etag":null,"topics":["image-reader","image-to-text","ocr","ocr-text-reader","read-image","tesseract-ocr"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/yardstick17.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE.txt","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2017-03-13T17:20:20.000Z","updated_at":"2025-04-01T20:38:16.000Z","dependencies_parsed_at":null,"dependency_job_id":"9ae7b959-04e3-457e-897c-1fd7e2a60b04","html_url":"https://github.com/yardstick17/image_text_reader","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/yardstick17/image_text_reader","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/yardstick17%2Fimage_text_reader","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/yardstick17%2Fimage_text_reader/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/yardstick17%2Fimage_text_reader/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/yardstick17%2Fimage_text_reader/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/yardstick17","download_url":"https://codeload.github.com/yardstick17/image_text_reader/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/yardstick17%2Fimage_text_reader/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":28630938,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-01-21T04:47:28.174Z","status":"ssl_error","status_checked_at":"2026-01-21T04:47:22.943Z","response_time":86,"last_error":"SSL_connect returned=1 errno=0 peeraddr=140.82.121.5:443 state=error: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["image-reader","image-to-text","ocr","ocr-text-reader","read-image","tesseract-ocr"],"created_at":"2024-08-01T13:01:32.502Z","updated_at":"2026-01-21T09:03:52.846Z","avatar_url":"https://github.com/yardstick17.png","language":"Python","readme":"# image_text_reader\n[![Gitter](https://badges.gitter.im/yardstick17/image_text_reader.svg)](https://gitter.im/yardstick17/image_text_reader?utm_source=badge\u0026utm_medium=badge\u0026utm_campaign=pr-badge) [![Build Status](https://travis-ci.org/yardstick17/image_text_reader.svg?branch=master)](https://travis-ci.org/yardstick17/image_text_reader)\n\nIt's a very basic tool to read images , images formatted like a restaurant-menu.\n\n### Tesseract-ocr\nThis tools need tesseract-ocr engine. Help yourself with this --\n* https://github.com/tesseract-ocr/tesseract/wiki\n\n#### Linux\n\nTesseract is available directly from many Linux distributions. The package is generally called 'tesseract' or 'tesseract-ocr' - search your distribution's repositories to find it. Thus you can install Tesseract 4.x and it's developer tools on Ubuntu 18.x bionic by simply running:\n```bash\nsudo apt install tesseract-ocr\nsudo apt install libtesseract-dev\n\n```\nRefer [here](https://github.com/tesseract-ocr/tesseract/wiki) for more on installation on all other systems.\n\n#### macOS\n\n##### Homebrew\n\nTo install Tesseract run this command:\n\n```bash\nbrew install tesseract\n```\n\n\n###### OCR reads the text extracted image from the full image. [Click here](https://github.com/yardstick17/menu_parser/blob/master/boxed_image.jpg)\n\n## Command to use:\n\n### Dockerized image reading\n\u003e docker run -it yardstick17/image-text-reader bash -c \"PYTHONPATH='.' python3 read_image.py read_text_from_local_image -f images/sample_image.jpg\"\n\n### Read from url\n\n\u003e PYTHONPATH='.' python3 read_image.py read_text_from_image_url -u https://marketplace.canva.com/MACHUlPU93Q/1/0/thumbnail_large/canva-peach-green-leaves-garden-vegetarian-pizza-menu-MACHUlPU93Q.jpg\n\n```bash\n[2017-07-07 16:20:34,119] INFO : Downloading image from url: https://marketplace.canva.com/MACHUlPU93Q/1/0/thumbnail_large/canva-peach-green-leaves-garden-vegeta\n[2017-07-07 16:20:35,997] INFO : Saving file: /var/folders/cz/n3vkz7x91qs06nmm9byxxgz00000gr/T/tmpienrxu2c\n[2017-07-07 16:20:35,997] INFO : Processing image for text Extraction\n[2017-07-07 16:20:36,308] INFO : Removing noise and smoothening image\n[2017-07-07 16:20:36,431] INFO : Reading the text inside the contour plotted\n```\n\n\n### Read from local image\n\u003e PYTHONPATH='.' python3 read_image.py read_text_from_local_image -f images/sample_image.jpg\n```bash\n[2017-07-07 16:32:38,862] INFO : Processing image for text Extraction\n[2017-07-07 16:32:39,232] INFO : Removing noise and smoothening image\n[2017-07-07 16:32:39,442] INFO : Reading the text inside the contour plotted\n```\n\n\n### Deploy an api for reading text from image!\n\u003e PYTHONPATH='.' python3 api/app.py\n```bash\n[2017-07-07 16:49:57,818] INFO :  * Running on http://0.0.0.0:6600/ (Press CTRL+C to quit)\n[2017-07-07 16:49:57,820] INFO :  * Restarting with stat\n[2017-07-07 16:49:58,712] WARNING :  * Debugger is active!\n[2017-07-07 16:49:58,738] INFO :  * Debugger pin code: 316-405-633\n```\n\n### Sample api deployed on my tiny server. Please be patient with them.\n```bash\ncurl -X POST \\\n  http://54.254.214.96/read_image_from_file/url \\\n  -F url=https://africatalentbank.com/wp-content/uploads/2014/10/Menu.jpg\n```\n\n### Digital Menu\n![Digital Image](images/digital_menu.jpg)\n\n### Original Image\n![Original Image](images/sample_image.jpg)\n","funding_links":[],"categories":["Python"],"sub_categories":[],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fyardstick17%2Fimage_text_reader","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fyardstick17%2Fimage_text_reader","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fyardstick17%2Fimage_text_reader/lists"}