Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/moha-cm/bizcardx
BizCardX: Extracting Business Card Data with OCR
https://github.com/moha-cm/bizcardx
data-extraction mariadb-database ocr ocr-python sql sqlalchemy streamlit-dashboard
Last synced: 7 days ago
JSON representation
BizCardX: Extracting Business Card Data with OCR
- Host: GitHub
- URL: https://github.com/moha-cm/bizcardx
- Owner: Moha-cm
- Created: 2023-11-11T10:58:25.000Z (about 1 year ago)
- Default Branch: main
- Last Pushed: 2023-12-02T14:33:31.000Z (12 months ago)
- Last Synced: 2023-12-03T13:28:17.303Z (12 months ago)
- Topics: data-extraction, mariadb-database, ocr, ocr-python, sql, sqlalchemy, streamlit-dashboard
- Language: Python
- Homepage:
- Size: 805 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Business Card OCR Application
## **Overview**
The task is to develop a Streamlit application that facilitates the extraction of relevant information from business cards using easyOCR. The extracted information includes the company name, card holder name, designation, mobile number, email address, website URL, area, city, state, and pin code. Additionally, the application should allow users to save this information, along with the uploaded business card image, into a database. The database should support multiple entries, each with its own business card image and extracted information.
## 1. Install Required Packages
Make sure to have the following packages installed:
```bash
# Install Python
install python
```# Required Python Packages
```
pip install pandas
pip install streamlit
pip install sqlalchemy
pip install PyMySQL
pip install numpy
pip install streamlit-option-menu
pip install Pillow
```# Install PyTorch and easyocr
```
install pytorch https://pytorch.org/
pip install easyocr
```## 2. Design User Interface
Create a user-friendly interface using Streamlit, incorporating widgets like file uploader, buttons, and text boxes to guide users through the process.## 3. Implement Image Processing and OCR
Utilize easyOCR for extracting relevant information from the business card image.## 4. Display Extracted Information
Present the extracted information in an organized manner within the Streamlit GUI, using widgets like tables, text boxes, and labels.## 5. Implement Database Integration
Integrate a database management system (SQLite or MySQL) to store extracted information and associated business card images. Utilize SQL queries for creating tables, inserting, updating, and retrieving data.## Run the application using the following command
```
streamlit run ./home.py
```
This will launch the Streamlit application, allowing you to upload business card images, extract information, and store it in the integrated database