An open API service indexing awesome lists of open source software.

https://github.com/utkarsh251106/handwritten-text-recognition-and-processing

Utilizes PyTesseract for OCR and a pre-trained VLM (Hugging Face Transformers) to extract and summarize text from images.
https://github.com/utkarsh251106/handwritten-text-recognition-and-processing

computer-vision image-processing machine-learning text-extraction-from-image vlm

Last synced: about 1 year ago
JSON representation

Utilizes PyTesseract for OCR and a pre-trained VLM (Hugging Face Transformers) to extract and summarize text from images.

Awesome Lists containing this project

README

          

## Description
This project combines Optical Character Recognition (OCR) with a Vision-Language Model (VLM) to extract text from images and generate concise summaries.

# How to run it?
### Step 1: Clone the Repository:

```bash
git clone https://github.com/Utkarsh251106/Handwritten-Text-Recognition-and-Processing.git
```
### Step 2: Create a conda environment:

```bash
conda create -n venv python=3.12.7 -y
conda activate venv
```

### Step 3: Install the requirements:

```bash
pip install -r requirements.txt
```
### Step 4: To run the code:
To run the code
```bash
# Start the Jupyter Notebook environment using the command
jupyter notebook
```
#### Run your Code.ipynb file