{"id":22161133,"url":"https://github.com/mr-destructive/ocr-playground","last_synced_at":"2025-07-25T22:36:14.197Z","repository":{"id":197003678,"uuid":"697792881","full_name":"Mr-Destructive/ocr-playground","owner":"Mr-Destructive","description":null,"archived":false,"fork":false,"pushed_at":"2023-09-28T13:53:53.000Z","size":9,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-01-29T20:37:18.547Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/Mr-Destructive.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null}},"created_at":"2023-09-28T13:41:11.000Z","updated_at":"2023-09-28T13:42:22.000Z","dependencies_parsed_at":null,"dependency_job_id":"e5b41d23-afe6-4b5e-8a62-e5a748726baf","html_url":"https://github.com/Mr-Destructive/ocr-playground","commit_stats":null,"previous_names":["mr-destructive/ocr-playground"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Mr-Destructive%2Focr-playground","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Mr-Destructive%2Focr-playground/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Mr-Destructive%2Focr-playground/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Mr-Destructive%2Focr-playground/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/Mr-Destructive","download_url":"https://codeload.github.com/Mr-Destructive/ocr-playground/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":245295620,"owners_count":20592072,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-12-02T04:13:17.775Z","updated_at":"2025-03-24T15:24:20.030Z","avatar_url":"https://github.com/Mr-Destructive.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"## OCR Playground\n\n\u003e This is a side project / assigment for understanding and learning OCRs/text extraction.\n\n- Using [tesseract](https://tesseract-ocr.github.io/tessdoc/Installation.html) as the OCR.\n- Using [PILLOW](https://pillow.readthedocs.io/en/stable/) for image processing.\n- Using [flask](https://flask.palletsprojects.com/en/1.1.x/) for the API.\n\n\n### Endpoints\n\n- `/uploads`: to upload the image/pdf and get the OCR results.\n- `/extract`: get text of rows related to particular column.\n- `/rotate`: rotate the image/bounding box in the ocr provided with the angle.\n- `/boxes`: get the bounding boxes in the entire document as a image.\n\n### TODO\n\n- Exploration of various concepts like OCRs, IDPs, OCDs, etc.\n- Implement the rotation feature more accurately.\n- Extract with the entire text resembling the document structure.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmr-destructive%2Focr-playground","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fmr-destructive%2Focr-playground","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmr-destructive%2Focr-playground/lists"}