https://github.com/roshniram/printed_text_recognition_and_conversion

cnn image-processing ml opencv python3

Last synced: about 1 month ago
JSON representation

Host: GitHub
URL: https://github.com/roshniram/printed_text_recognition_and_conversion
Owner: roshniRam
License: mit
Created: 2019-03-10T17:58:22.000Z (over 7 years ago)
Default Branch: master
Last Pushed: 2019-03-10T18:04:03.000Z (over 7 years ago)
Last Synced: 2025-07-08T05:44:19.974Z (12 months ago)
Topics: cnn, image-processing, ml, opencv, python3
Language: Python
Homepage:
Size: 35.9 MB
Stars: 1
Watchers: 0
Forks: 0
Open Issues: 1
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

          # Printed-Text-recognition-and-conversion

[![forthebadge](https://forthebadge.com/images/badges/made-with-python.svg)](https://forthebadge.com)

[![forthebadge](https://forthebadge.com/images/badges/built-with-love.svg)](https://forthebadge.com)

## Introduction

These days there is a huge demand in storing the information available in paper documents into a computer, storage disk and then later reusing this information by searching process. One simple way to store information from these paper documents in to computer system is to first scan the documents and then store them as images. But to reuse this information it is very difficult to read the individual contents and searching the contents form these documents line-by-line and word-by-word.This poses an inconvenience because the image is not searchable or editable. Even when we want to convert scanned images directly into pdf, they are not in editable or searchable format.

The aim of this project was to make a software which would be capable of identifying and recognizing English typed text from an image(.jpg, .jpeg, .png) and convert it to an editable format(.txt ,etc) so that it can be directly modified without the need for typing the text document again manually. The project involves the implementation of Image Processing techniques and Machine Learning Algorithms.

## Approach:

 * Image Processing:

   


    Binarization

    Skew-Correction

   

 * Segmentation 

   

    Line segmentation

    Character segmentation

   

 * The training is done using CNN model.

## To install

The language used is Python3

  

  ### Required libraries

  ```sh

      Numpy

      OpenCV

      Sklearn

      Scikit

      Tensorflow

      PyQt4

```

  ### To run through GUI

  ```sh

      python gui.py      

```

  ### To run on CLI

  ```sh

      python main.py     

```

### Authors

[Roshni Ram](https://github.com/roshniRam)

[](https://twitter.com/RoshniRam1)

[](https://www.linkedin.com/in/roshni-ram-306b0a164/)

[](https://www.facebook.com/roshni.ram.7)

[](https://www.instagram.com/roshniram11/)

[Ishita Das](https://github.com/ishita27)

[](https://twitter.com/ishitaaaa_)

[](https://www.linkedin.com/in/ishita-das-22b457171/)

[](https://www.facebook.com/ishita.das.792)

[](https://www.instagram.com/ishitadas27/)

[Rohit Shamdasani](https://github.com/Rohit280297)

[](https://twitter.com/shamdasanirohit)

[](https://www.linkedin.com/in/rohit-shamdasani-69999b165/)

[](https://www.facebook.com/rohit.shamdasani.5)

[](https://www.instagram.com/shamdasanirohit/)

Ayush Mudgal

[](https://www.linkedin.com/in/ayush-mudgal-ba71a9150/)

[](https://www.facebook.com/ayushmudgal11)

[](https://www.instagram.com/ayush_mudgal11/)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/roshniram/printed_text_recognition_and_conversion

Awesome Lists containing this project

README