https://github.com/utkarsh251106/handwritten-text-recognition-and-processing

Utilizes PyTesseract for OCR and a pre-trained VLM (Hugging Face Transformers) to extract and summarize text from images.
https://github.com/utkarsh251106/handwritten-text-recognition-and-processing

computer-vision image-processing machine-learning text-extraction-from-image vlm

Last synced: about 1 year ago
JSON representation

Utilizes PyTesseract for OCR and a pre-trained VLM (Hugging Face Transformers) to extract and summarize text from images.

Host: GitHub
URL: https://github.com/utkarsh251106/handwritten-text-recognition-and-processing
Owner: Utkarsh251106
Created: 2025-03-22T18:11:33.000Z (over 1 year ago)
Default Branch: main
Last Pushed: 2025-03-22T18:59:45.000Z (over 1 year ago)
Last Synced: 2025-03-22T19:26:26.323Z (over 1 year ago)
Topics: computer-vision, image-processing, machine-learning, text-extraction-from-image, vlm
Language: Jupyter Notebook
Homepage:
Size: 190 KB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

## Description
This project combines Optical Character Recognition (OCR) with a Vision-Language Model (VLM) to extract text from images and generate concise summaries.

# How to run it?
### Step 1: Clone the Repository:

```bash
git clone https://github.com/Utkarsh251106/Handwritten-Text-Recognition-and-Processing.git
```
### Step 2: Create a conda environment:

```bash
conda create -n venv python=3.12.7 -y
conda activate venv
```

### Step 3: Install the requirements:

```bash
pip install -r requirements.txt
```
### Step 4: To run the code:
To run the code
```bash
# Start the Jupyter Notebook environment using the command
jupyter notebook
```
#### Run your Code.ipynb file

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/utkarsh251106/handwritten-text-recognition-and-processing

Awesome Lists containing this project

README