An open API service indexing awesome lists of open source software.

https://github.com/drisskhattabi6/pytesseract-ocr-for-image-and-pdfs

This Repo contains implementation of OCR for Image and PDFs Using Pytesseract and OpenCV
https://github.com/drisskhattabi6/pytesseract-ocr-for-image-and-pdfs

Last synced: 3 months ago
JSON representation

This Repo contains implementation of OCR for Image and PDFs Using Pytesseract and OpenCV

Awesome Lists containing this project

README

          

# ๐Ÿงพ Pytesseract OCR for Images and PDFs

This repository demonstrates how to extract text from **images** and **PDF documents** using **Pytesseract** โ€” a Python wrapper for Google's Tesseract-OCR Engine.

## ๐Ÿ“Œ Overview

The project showcases:

* How to apply OCR to **images** (JPEG, PNG, etc.)
* How to convert **PDFs** to images and extract text using OCR
* Code examples implemented in Jupyter Notebooks

## Example

Source text :

![](imgs/easy_text.png)

Detected Image :

![](imgs/text_with_boxes.jpg)