{"id":19607143,"url":"https://github.com/robocorp/example-parse-pdf-invoice","last_synced_at":"2025-07-02T15:36:15.454Z","repository":{"id":103903264,"uuid":"595528814","full_name":"robocorp/example-parse-pdf-invoice","owner":"robocorp","description":"Extract information from PDF invoices","archived":false,"fork":false,"pushed_at":"2024-02-07T08:26:04.000Z","size":211,"stargazers_count":2,"open_issues_count":0,"forks_count":1,"subscribers_count":14,"default_branch":"master","last_synced_at":"2025-02-26T16:50:19.802Z","etag":null,"topics":["ai","library","pdf","rpaframework","text"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"apache-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/robocorp.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2023-01-31T09:14:27.000Z","updated_at":"2024-04-01T13:05:21.000Z","dependencies_parsed_at":"2025-01-10T08:45:47.047Z","dependency_job_id":null,"html_url":"https://github.com/robocorp/example-parse-pdf-invoice","commit_stats":null,"previous_names":[],"tags_count":1,"template":false,"template_full_name":null,"purl":"pkg:github/robocorp/example-parse-pdf-invoice","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/robocorp%2Fexample-parse-pdf-invoice","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/robocorp%2Fexample-parse-pdf-invoice/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/robocorp%2Fexample-parse-pdf-invoice/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/robocorp%2Fexample-parse-pdf-invoice/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/robocorp","download_url":"https://codeload.github.com/robocorp/example-parse-pdf-invoice/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/robocorp%2Fexample-parse-pdf-invoice/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":263166734,"owners_count":23424230,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["ai","library","pdf","rpaframework","text"],"created_at":"2024-11-11T10:09:05.941Z","updated_at":"2025-07-02T15:36:15.428Z","avatar_url":"https://github.com/robocorp.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Extract data from PDF files displaying invoice like information\n\nShow-case multiple ways of extracting information from different kinds of PDF files\n(text based or scans), mainly presenting invoice data.\n\nRead more on the\n[challenges](https://pypdf.readthedocs.io/en/latest/user/extract-text.html) of getting\ninformation out of PDF files.\n\n## Tasks\n\n### Extract Text Data\n\nExtract textual data from a PDF file.\n\n\u003e Usually this is sufficient for most of the cases.\n\n\n### Extract element from table in PDF\n\nIn some cases, it may be easier to find the elements and their neighbours instead of just parsing the text. In this example we find rows and columns from a table in a PDF document.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Frobocorp%2Fexample-parse-pdf-invoice","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Frobocorp%2Fexample-parse-pdf-invoice","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Frobocorp%2Fexample-parse-pdf-invoice/lists"}