Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/dilippuri/pan-card-ocr
Retrive meaningful information from PAN Card image using tesseract-ocr :sunglasses:
https://github.com/dilippuri/pan-card-ocr
aadhar ocr optical-character-recognition pan-card
Last synced: 6 days ago
JSON representation
Retrive meaningful information from PAN Card image using tesseract-ocr :sunglasses:
- Host: GitHub
- URL: https://github.com/dilippuri/pan-card-ocr
- Owner: dilippuri
- License: mit
- Created: 2016-06-20T12:27:17.000Z (over 8 years ago)
- Default Branch: master
- Last Pushed: 2024-08-29T04:36:04.000Z (4 months ago)
- Last Synced: 2024-12-21T10:13:43.251Z (13 days ago)
- Topics: aadhar, ocr, optical-character-recognition, pan-card
- Language: Python
- Homepage:
- Size: 110 KB
- Stars: 123
- Watchers: 12
- Forks: 60
- Open Issues: 10
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
:sparkles: This document for OCR :sparkles:
![PAN Card to JSON](PANOcr1.jpg?raw=true "PAN Card image")
*****************************************************
Problem:
*****************************************************
Extract information from image of Personal Account Number(PAN) Card
by OCR in proper format[Standard according Indian Govt.].
Information like -
Name, Father's Name, Date of Birth, PAN
**********************************************************************************************************
Solution:
*****************************************************
Steps:
-> Take image
-> crop to box(which has text in it)
-> convert into gray scale(mono crome)
-> give to tesseract
-> text(output of tesseract)
Now we will process this text means we will get meaningful information from it.
-> find name using name database
-> find father's name(assuming that second will be father's name)
-> find year of birth
-> find for PAN
*****************************************************
*****************************************************
Dependent packages
*****************************************************
-python
-opencv
-numpy
-pytesseract
-JSON
-difflib
-csv
-PIL
-SciPy
-dataparser
**********************************************************************************************************
Structure and Usage
*****************************************************
Directories:
src-
which contains code files
testcases-
which contains testing images
result
it contains JSON object
Usage:
python file_name.py [input image]
Output will be JSON object name
*****************************************************
:100: