Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/ReceiptManager/receipt-parser-legacy
A supermarket receipt parser written in Python using tesseract OCR
https://github.com/ReceiptManager/receipt-parser-legacy
home-assistant invoice ocr receipt receipt-parser supermarket
Last synced: 10 days ago
JSON representation
A supermarket receipt parser written in Python using tesseract OCR
- Host: GitHub
- URL: https://github.com/ReceiptManager/receipt-parser-legacy
- Owner: ReceiptManager
- License: apache-2.0
- Created: 2015-10-02T13:42:09.000Z (about 9 years ago)
- Default Branch: master
- Last Pushed: 2024-08-28T15:25:41.000Z (3 months ago)
- Last Synced: 2024-10-29T17:28:32.610Z (15 days ago)
- Topics: home-assistant, invoice, ocr, receipt, receipt-parser, supermarket
- Language: Python
- Homepage: https://tech.trivago.com/2015/10/06/python_receipt_parser/
- Size: 10.6 MB
- Stars: 813
- Watchers: 32
- Forks: 196
- Open Issues: 11
-
Metadata Files:
- Readme: README.md
- Funding: .github/FUNDING.yml
- License: LICENSE
Awesome Lists containing this project
README
# A fuzzy receipt parser written in Python
This is a fuzzy receipt parser written in Python.
It extracts information like the shop, the date, and the total from scanned receipts.
It can work as a standalone script or as part of our [IOS and Android application](https://github.com/ReceiptManager/Application).## Dependencies
The `receipt-parser-core` library depend on `imagemagick`. Please install `imagemagick`
with your favorite package manager.## Usage
To convert all images from the `data/img/` folder to text using tesseract and parse the resulting text files, run```
make run
```### Docker
A `Dockerfile` is available with all dependencies needed to run the program.
To build the image, run```
make docker-build
```To run it on the sample files, try
```
make docker-run
```By default, running the image will execute the `make run` command. To use with your own images, run the following:
```
docker run -v :/app/data/img mre0/receipt_parser
```## History
This project started as a hackathon idea. Read more about it on the [trivago techblog](https://tech.trivago.com/2015/10/06/python_receipt_parser/).
Also read the comments on [HackerNews](https://news.ycombinator.com/item?id=10338199)
There's also a [talk](https://www.youtube.com/watch?v=TuDeUsIlJz4) about the project.
The library is now available at [PyPi](https://pypi.org/project/receipt-parser-core/#description).