{"id":13543144,"url":"https://github.com/ReceiptManager/receipt-parser-legacy","last_synced_at":"2025-04-02T12:31:07.822Z","repository":{"id":37722416,"uuid":"43554926","full_name":"ReceiptManager/receipt-parser-legacy","owner":"ReceiptManager","description":"A supermarket receipt parser written in Python using tesseract OCR","archived":false,"fork":false,"pushed_at":"2024-08-28T15:25:41.000Z","size":11101,"stargazers_count":829,"open_issues_count":11,"forks_count":196,"subscribers_count":31,"default_branch":"master","last_synced_at":"2025-03-28T14:18:27.126Z","etag":null,"topics":["home-assistant","invoice","ocr","receipt","receipt-parser","supermarket"],"latest_commit_sha":null,"homepage":"https://tech.trivago.com/2015/10/06/python_receipt_parser/","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"apache-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/ReceiptManager.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":".github/FUNDING.yml","license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null},"funding":{"open_collective":"receiptmanager"}},"created_at":"2015-10-02T13:42:09.000Z","updated_at":"2025-03-21T09:50:09.000Z","dependencies_parsed_at":"2023-11-08T05:52:39.033Z","dependency_job_id":"f86d1422-6114-455a-bcac-c5d6950a53d3","html_url":"https://github.com/ReceiptManager/receipt-parser-legacy","commit_stats":{"total_commits":254,"total_committers":28,"mean_commits":9.071428571428571,"dds":0.6732283464566929,"last_synced_commit":"ea17264147ce9b8ede5ddbbf2ef75c514a2e57bf"},"previous_names":["receiptmanager/receipt-parser"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ReceiptManager%2Freceipt-parser-legacy","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ReceiptManager%2Freceipt-parser-legacy/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ReceiptManager%2Freceipt-parser-legacy/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ReceiptManager%2Freceipt-parser-legacy/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/ReceiptManager","download_url":"https://codeload.github.com/ReceiptManager/receipt-parser-legacy/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":246815456,"owners_count":20838441,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["home-assistant","invoice","ocr","receipt","receipt-parser","supermarket"],"created_at":"2024-08-01T11:00:24.082Z","updated_at":"2025-04-02T12:31:07.788Z","avatar_url":"https://github.com/ReceiptManager.png","language":"Python","funding_links":["https://opencollective.com/receiptmanager"],"categories":["Text detection and localization","Python"],"sub_categories":["Form Segmentation"],"readme":"# A fuzzy receipt parser written in Python  \n\nThis is a fuzzy receipt parser written in Python. \nIt extracts information like the shop, the date, and the total from scanned receipts.\nIt can work as a standalone script or as part of our [IOS and Android application](https://github.com/ReceiptManager/Application).\n\n## Dependencies\nThe `receipt-parser-core` library depend on `imagemagick`. Please install `imagemagick`\nwith your favorite package manager.\n\n## Usage\nTo convert all images from the `data/img/` folder to text using tesseract and parse the resulting text files, run\n\n```\nmake run\n```\n\n### Docker\n\nA `Dockerfile` is available with all dependencies needed to run the program.  \nTo build the image, run\n\n```\nmake docker-build\n```\n\nTo run it on the sample files, try\n\n```\nmake docker-run\n```\n\nBy default, running the image will execute the `make run` command. To use with your own images, run the following:\n\n```\ndocker run -v \u003cpath_to_input_images\u003e:/app/data/img mre0/receipt_parser\n```\n\n## History\n\nThis project started as a hackathon idea. Read more about it on the [trivago techblog](https://tech.trivago.com/2015/10/06/python_receipt_parser/).\nAlso read the comments on [HackerNews](https://news.ycombinator.com/item?id=10338199)\nThere's also a [talk](https://www.youtube.com/watch?v=TuDeUsIlJz4) about the project.\nThe library is now available at [PyPi](https://pypi.org/project/receipt-parser-core/#description).\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2FReceiptManager%2Freceipt-parser-legacy","html_url":"https://awesome.ecosyste.ms/projects/github.com%2FReceiptManager%2Freceipt-parser-legacy","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2FReceiptManager%2Freceipt-parser-legacy/lists"}