https://github.com/utkarsh251106/handwritten-text-recognition-and-processing
Utilizes PyTesseract for OCR and a pre-trained VLM (Hugging Face Transformers) to extract and summarize text from images.
https://github.com/utkarsh251106/handwritten-text-recognition-and-processing
computer-vision image-processing machine-learning text-extraction-from-image vlm
Last synced: about 1 year ago
JSON representation
Utilizes PyTesseract for OCR and a pre-trained VLM (Hugging Face Transformers) to extract and summarize text from images.
- Host: GitHub
- URL: https://github.com/utkarsh251106/handwritten-text-recognition-and-processing
- Owner: Utkarsh251106
- Created: 2025-03-22T18:11:33.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2025-03-22T18:59:45.000Z (over 1 year ago)
- Last Synced: 2025-03-22T19:26:26.323Z (over 1 year ago)
- Topics: computer-vision, image-processing, machine-learning, text-extraction-from-image, vlm
- Language: Jupyter Notebook
- Homepage:
- Size: 190 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
## Description
This project combines Optical Character Recognition (OCR) with a Vision-Language Model (VLM) to extract text from images and generate concise summaries.
# How to run it?
### Step 1: Clone the Repository:
```bash
git clone https://github.com/Utkarsh251106/Handwritten-Text-Recognition-and-Processing.git
```
### Step 2: Create a conda environment:
```bash
conda create -n venv python=3.12.7 -y
conda activate venv
```
### Step 3: Install the requirements:
```bash
pip install -r requirements.txt
```
### Step 4: To run the code:
To run the code
```bash
# Start the Jupyter Notebook environment using the command
jupyter notebook
```
#### Run your Code.ipynb file