Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/isabelleysseric/ocr-system

OCR System is a Python application that allows you to extract text from PDF documents, uploaded images, or images captured via a webcam. The application uses the Tesseract OCR library to recognize and extract text from images or PDF pages.
https://github.com/isabelleysseric/ocr-system

image-recognition natural-language-processing ocr tesseract-ocr text-detection

Last synced: 8 days ago
JSON representation

Host: GitHub
URL: https://github.com/isabelleysseric/ocr-system
Owner: isabelleysseric
License: mit
Created: 2024-08-27T05:44:00.000Z (3 months ago)
Default Branch: master
Last Pushed: 2024-09-10T20:08:17.000Z (2 months ago)
Last Synced: 2024-09-11T20:12:57.923Z (2 months ago)
Topics: image-recognition, natural-language-processing, ocr, tesseract-ocr, text-detection
Language: Python
Homepage:
Size: 825 KB
Stars: 0
Watchers: 2
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

Optical Character Recognition (OCR) System

![OCR System interface image](data/api/OCR-System.png)

## Author
[Isabelle Eysseric](https://github.com/isabelleysseric)

## Project Description

OCR System is a Python application that allows you to extract text from PDF documents, uploaded images or images captured via a webcam. The application uses the Tesseract OCR library to recognize and extract text from images or PDF pages.

## Features

- **Load PDF**: Loads a PDF file and extracts the text from each page.
- **Load Image**: Loads an image from the file system and extracts the text.
- **Capture Image**: Captures an image via a webcam and extracts the text.
- **Extract Text**: Saves the extracted text to a `.txt` file, including information about the source (PDF, uploaded image, captured image).

## Prerequisites

Before using the application, make sure you have installed the following dependencies:

- Python 3.7+
- Tkinter (included with standard Python)
- OpenCV (`cv2`)
- NumPy
- Pillow (`PIL`)
- Tesseract-OCR
- PyMuPDF (`fitz`)
- PDF2Image
- PyTesseract

### Installing Tesseract-OCR

1. Download and install Tesseract-OCR from [the official website](https://github.com/tesseract-ocr/tesseract).
2. Add Tesseract to your PATH or specify the path to `tesseract.exe` in the Python script.

### Installing Python Dependencies

You can install the required Python dependencies by running:

```bash
pip install -r requirements.txt
```

## Usage

To start the application, run the main script:

```bash
python gui.py
```

### User Interface

* **Load PDF**: Open a PDF file, convert each page to an image, and then extract the text.
* **Load Image**: Load an image from your file system to extract the text.
* **Capture Image**: Use your webcam to capture an image and extract the text.
* **Analyze Image**: Apply preprocessing to improve text recognition.
* **Extract Text**: Extract text from the selected source (PDF, loaded image, or captured image) and save the extracted text to a `.txt` file.

### Saving results

When extracting text, a `.txt` file is generated, including information about the source of the text:

* `captured_image_extracted_text.txt`: Text extracted from an image captured via webcam.
* `loaded_image_extracted_text.txt`: Text extracted from an uploaded image.
* `loaded_pdf_extracted_text.txt`: Text extracted from an uploaded PDF file.

## # OCR System

## Project Description

## Features

## Prerequisites

Before using the application, make sure you have installed the following dependencies:

- Python 3.7+
- Tkinter (included with standard Python)
- OpenCV (`cv2`)
- NumPy
- Pillow (`PIL`)
- Tesseract-OCR
- PyMuPDF (`fitz`)
- PDF2Image
- PyTesseract

### Installing Tesseract-OCR

### Installing Python Dependencies

You can install the required Python dependencies by running:

```bash
pip install -r requirements.txt
```

## Usage

To start the application, run the main script:

```bash
python gui.py
```

### User Interface

### Saving results

When extracting text, a `.txt` file is generated, including information about the source of the text:

## Common issues

### `TesseractNotFoundError`

* Make sure Tesseract-OCR is installed and the path to `tesseract.exe` is set correctly in the Python script.

### No text extraction

* Make sure the uploaded image or PDF contains recognizable text. Try improving the image quality or using a higher resolution.

## Contribution

Contributions are welcome! If you have any ideas for improvement, feel free to submit a pull request or open an issue.

## License

This project is licensed under the MIT License. See the [LICENSE](https://github.com/isabelleysseric/OCR-System/blob/master/LICENSE) file for more information.