Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

https://github.com/Abhradipta/OCR-With-Read-Out-Loud-Using-Python

An Optical Character Recognition (OCR) System designed using Python to read the contents out loud.
https://github.com/Abhradipta/OCR-With-Read-Out-Loud-Using-Python

computer-vision google-tesseract google-text-to-speech natural-language-processing opencv python tesseract-ocr text-to-speech

Last synced: 4 days ago
JSON representation

An Optical Character Recognition (OCR) System designed using Python to read the contents out loud.

Host: GitHub
URL: https://github.com/Abhradipta/OCR-With-Read-Out-Loud-Using-Python
Owner: Abhradipta
License: mit
Created: 2020-08-17T14:00:19.000Z (almost 4 years ago)
Default Branch: master
Last Pushed: 2020-09-09T14:41:42.000Z (almost 4 years ago)
Last Synced: 2024-03-17T00:43:55.905Z (4 months ago)
Topics: computer-vision, google-tesseract, google-text-to-speech, natural-language-processing, opencv, python, tesseract-ocr, text-to-speech
Language: Python
Homepage:
Size: 8.79 KB
Stars: 3
Watchers: 2
Forks: 1
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Lists

awesome-stars - Abhradipta/OCR-With-Read-Out-Loud-Using-Python - An Optical Character Recognition (OCR) System designed using Python to read the contents out loud. (Python)

README

# OCR-With-Read-Out-Loud-Using-Python

This is an Optical Character Recognition (OCR) System designed using Python and reads the contents out loud.

The system takes the image file as input and runs an optical character recognition using Google Tesseract OCR Engine and analyses the text present within the image. It then saves the determined text from the image within a text file. The contents of the text file are then converted to an audio file using Google Text To Speech. The created audio file is then played.

This is a simple OCR created using Python with an additional feature of reading the text present within the image out loud.

Required Modules: The following modules are needed to be imported in Python 3.8 codebase:

OpenCV 3.2.0 (cv2): This module is used for detection of image.

Pytesseract: This module is used to run the optical character recognition on the image.

Google Text To Speech (gTTS): This module is used to convert the text to speech.

Playsound: This module is used to play the audio.