{"id":19811351,"url":"https://github.com/diging/innogen-script","last_synced_at":"2026-06-09T03:32:31.504Z","repository":{"id":68005489,"uuid":"557451918","full_name":"diging/innogen-script","owner":"diging","description":"Python script to extract images from scanned documents using OpenCV","archived":false,"fork":false,"pushed_at":"2022-10-25T21:34:07.000Z","size":5,"stargazers_count":0,"open_issues_count":0,"forks_count":1,"subscribers_count":2,"default_branch":"main","last_synced_at":"2025-02-28T18:46:40.230Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/diging.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2022-10-25T18:02:46.000Z","updated_at":"2022-10-25T18:19:14.000Z","dependencies_parsed_at":"2023-04-27T19:03:10.572Z","dependency_job_id":null,"html_url":"https://github.com/diging/innogen-script","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/diging/innogen-script","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/diging%2Finnogen-script","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/diging%2Finnogen-script/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/diging%2Finnogen-script/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/diging%2Finnogen-script/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/diging","download_url":"https://codeload.github.com/diging/innogen-script/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/diging%2Finnogen-script/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":34090751,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-05-26T15:22:16.424Z","status":"online","status_checked_at":"2026-06-09T02:00:06.510Z","response_time":63,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-11-12T09:25:57.139Z","updated_at":"2026-06-09T03:32:31.465Z","avatar_url":"https://github.com/diging.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"## Extract images from Scans\n\nThis script uses OpenCV to find and extract images from scans. To run, clone repo, and then build and run the Docker container.\n\nFor example, if your image is in a subfolder `images`:\n```\ndocker build -t extract_imgs .\ndocker run --mount type=bind,source=\"$(pwd)\",target=/data extract_imgs -f /data/images/file.jpg -o /data/images/extracted/\n```\nThe extracted images will be in `images/extracted/extracted`.\n\nThe build step will take quite a bit of time, while OpenCV is being built.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdiging%2Finnogen-script","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fdiging%2Finnogen-script","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdiging%2Finnogen-script/lists"}