https://github.com/Reiqy/document-scanner

Document Scanner application in Python
https://github.com/Reiqy/document-scanner

computer-vision python scanner

Last synced: 5 months ago
JSON representation

Document Scanner application in Python

Host: GitHub
URL: https://github.com/Reiqy/document-scanner
Owner: Reiqy
License: mit
Created: 2021-07-04T19:05:40.000Z (over 4 years ago)
Default Branch: main
Last Pushed: 2021-07-06T15:39:18.000Z (over 4 years ago)
Last Synced: 2024-08-05T09:16:17.138Z (over 1 year ago)
Topics: computer-vision, python, scanner
Language: Python
Homepage:
Size: 3.53 MB
Stars: 1
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# Document Scanner

Simple CLI application for scanning documents from images. This application is build around algorithm proposed by Adrian Rosebrock [[1]](https://www.pyimagesearch.com/2014/09/01/build-kick-ass-mobile-document-scanner-just-5-minutes/) and uses OpenCV's adaptive thresholding [[2]](https://docs.opencv.org/4.5.2/d7/d4d/tutorial_py_thresholding.html) for scanner-like look of the output images.

![Input Image](readme/1_mini.jpg)
![Outut Image](readme/1_out_mini.jpg)

_Source image on the left is converted into the output image on the right._

## Usage

To use Document Scanner you need to have Python installed with all the required packages as specified in `requirements.txt`. Run

$ pip install -r requirements.txt

Then you can run the script by calling

$ python scanner.py

There are several arguments that have to be specified for successful execution of the program. You need to specify the format of the program output. You can choose between

1. `-v`, this option only displays the output images. Additionaly to this option you can optionally specify
- `-t `, target height of the displayed images on screen;

2. `-p `, this option creates a `.pdf` with specified `` (note that the name must have the `.pdf` extension). If you want you can optionally specify dimensions of the images in the generate file with
- `-d `, defaults for these options are numbers `210` and `297` (dimensions of A4 paper);

3. `-i `, this option generates individual images in directory ``.

For all the options you can specify the `-s ` which will apply post-processing effects to make the document look more like if it was scanned by conventional document scanner. Numbers `` and `` can either be zeros. This will cause the output images to be converted to grayscale. Or they can be positive and be used as arguments to the adaptive thresholding. First number stands for block size, second is constant `c`. Good results are obtained with numbers `11` and `10`.

Then you have to specify at least one `` of input image.

Full call of the script can look like this

$ python scanner.py -p out/test.pdf -s 11 10 data/1.jpg data/2.jpg

Keep in mind that the command line arguments will likely change in future versions.

## Development

This project is still in development. Future goals are

1. Better post-processing of the output images;

2. Plain-text output.

## Reference

[1] [Building Mobile Document Scanner by Adrian Rosebrock](https://www.pyimagesearch.com/2014/09/01/build-kick-ass-mobile-document-scanner-just-5-minutes/)

[2] [Adaptive Thresholding](https://docs.opencv.org/4.5.2/d7/d4d/tutorial_py_thresholding.html)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/Reiqy/document-scanner

Awesome Lists containing this project

README