Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/dilippuri/pan-card-ocr

Retrive meaningful information from PAN Card image using tesseract-ocr :sunglasses:
https://github.com/dilippuri/pan-card-ocr

aadhar ocr optical-character-recognition pan-card

Last synced: 6 days ago
JSON representation

Retrive meaningful information from PAN Card image using tesseract-ocr :sunglasses:

Awesome Lists containing this project

README

        

:sparkles: This document for OCR :sparkles:

![PAN Card to JSON](PANOcr1.jpg?raw=true "PAN Card image")

*****************************************************
Problem:
*****************************************************
Extract information from image of Personal Account Number(PAN) Card
by OCR in proper format[Standard according Indian Govt.].
Information like -
Name, Father's Name, Date of Birth, PAN
*****************************************************

*****************************************************
Solution:
*****************************************************
Steps:
-> Take image
-> crop to box(which has text in it)
-> convert into gray scale(mono crome)
-> give to tesseract
-> text(output of tesseract)
Now we will process this text means we will get meaningful information from it.
-> find name using name database
-> find father's name(assuming that second will be father's name)
-> find year of birth
-> find for PAN
*****************************************************


*****************************************************
Dependent packages
*****************************************************
-python
-opencv
-numpy
-pytesseract
-JSON
-difflib
-csv
-PIL
-SciPy
-dataparser
*****************************************************

*****************************************************
Structure and Usage
*****************************************************
Directories:
src-
which contains code files
testcases-
which contains testing images
result
it contains JSON object

Usage:
python file_name.py [input image]
Output will be JSON object name

*****************************************************
:100: