https://github.com/wavemaker/prefab-entity-extractor
Extract data from invoices, receipts, PDF documents, screenshots of documents
https://github.com/wavemaker/prefab-entity-extractor
prefab wavemaker-prefab
Last synced: 10 days ago
JSON representation
Extract data from invoices, receipts, PDF documents, screenshots of documents
- Host: GitHub
- URL: https://github.com/wavemaker/prefab-entity-extractor
- Owner: wavemaker
- License: apache-2.0
- Created: 2020-07-29T12:22:20.000Z (over 5 years ago)
- Default Branch: master
- Last Pushed: 2020-09-07T05:15:09.000Z (over 5 years ago)
- Last Synced: 2025-03-04T12:45:40.842Z (about 1 year ago)
- Topics: prefab, wavemaker-prefab
- Homepage:
- Size: 16.5 MB
- Stars: 1
- Watchers: 6
- Forks: 1
- Open Issues: 3
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Extract data from invoices, receipts, PDF documents, screenshots of documents
There are many situations where data has to be extracted from a scanned recipt or a PDF document or an screenshot taken from a mobile phone. Automatically extracting information from such documents will enable in building delighful user experiences. This prefab will help implement such extraction of data any structured document. Given the document as input, the prefab uploads to it to [AWS Textract](https://docs.aws.amazon.com/textract/latest/dg/what-is.html) to extract data from it. The extracted data is available as key-value pair and developer can pick the data they are interested in.