{"id":19282868,"url":"https://github.com/ocrmypdf/ocrmypdf-easyocr","last_synced_at":"2025-04-10T04:55:52.019Z","repository":{"id":180352010,"uuid":"664999509","full_name":"ocrmypdf/OCRmyPDF-EasyOCR","owner":"ocrmypdf","description":"OCRmyPDF EasyOCR plugin","archived":false,"fork":false,"pushed_at":"2025-04-02T06:22:42.000Z","size":172,"stargazers_count":73,"open_issues_count":5,"forks_count":13,"subscribers_count":6,"default_branch":"main","last_synced_at":"2025-04-02T07:25:33.612Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/ocrmypdf.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2023-07-11T08:08:57.000Z","updated_at":"2025-04-02T06:22:46.000Z","dependencies_parsed_at":"2024-03-23T21:34:52.245Z","dependency_job_id":"c087fdec-bfb9-4429-b11e-59acec73c1a2","html_url":"https://github.com/ocrmypdf/OCRmyPDF-EasyOCR","commit_stats":null,"previous_names":["ocrmypdf/ocrmypdf-easyocr"],"tags_count":3,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ocrmypdf%2FOCRmyPDF-EasyOCR","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ocrmypdf%2FOCRmyPDF-EasyOCR/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ocrmypdf%2FOCRmyPDF-EasyOCR/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ocrmypdf%2FOCRmyPDF-EasyOCR/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/ocrmypdf","download_url":"https://codeload.github.com/ocrmypdf/OCRmyPDF-EasyOCR/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":248161258,"owners_count":21057554,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-11-09T21:28:41.780Z","updated_at":"2025-04-10T04:55:51.993Z","avatar_url":"https://github.com/ocrmypdf.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# OCRmyPDF EasyOCR\n\nThis is plugin to run OCRmyPDF with the EasyOCR engine instead of Tesseract OCR,\nthe default OCR engine for OCRmyPDF. Since EasyOCR is based on PyTorch, it makes\nuse of Nvidia GPUs. Hopefully it will be more performant and accurate than Tesseract OCR.\n\nIt is currently experimental and does not implement all of the features of\nOCRmyPDF with Tesseract, and still relies on Tesseract for certain operations.\n\n## Installation\n\nTo use this plugin, first\n[install PyTorch according to the official instructions](https://pytorch.org/),\nwhich may differ for your platform.\n\nThen install OCRmyPDF-EasyOCR to the same virtual environment or conda environment\nas you installed PyTorch:\n\n```bash\npip install git+https://github.com/ocrmypdf/OCRmyPDF-EasyOCR.git\n```\n\nThe OCRmyPDF-EasyOCR will override Tesseract for OCR; however, OCR still depends\non Tesseract for some tasks.\n\nIf [Celery's multiprocessing](https://docs.celeryq.dev/en/stable/getting-started/introduction.html)\nis installed in the virtual environment, it will be used instead of the standard\nPython multiprocessing. This allows paperless-ngx, which uses Celery, to function correctly.\n\n## Troubleshooting\n\nIf you see a log message\n``Neither CUDA nor MPS are available - defaulting to CPU. Note: This module is much faster with a GPU``\nthen PyTorch is not installed.\n\n## To do\n\nContributions, especially pull requests are quite welcome!\n\nAt the moment this plugin is alpha status and missing some essential features:\n- Tesseract is still required for determine page skew and for orientation correction\n- EasyOCR is effectively single threaded, to eliminate race conditions\n\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Focrmypdf%2Focrmypdf-easyocr","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Focrmypdf%2Focrmypdf-easyocr","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Focrmypdf%2Focrmypdf-easyocr/lists"}