An open API service indexing awesome lists of open source software.

https://github.com/amadeusssystem/fieldnotesdigitizer


https://github.com/amadeusssystem/fieldnotesdigitizer

Last synced: 11 months ago
JSON representation

Awesome Lists containing this project

README

          

# PDF to Transparent Images Converter

This Python script converts each page of a PDF document into an image with a transparent background, removing any ink (black text) and leaving the background transparent. The output images are saved as PNG files with transparency.

## Results
- Before
- ![image](https://github.com/user-attachments/assets/28c78fd0-afce-452f-b8a2-a7c3125761f1)
***
- After
- ![image](https://github.com/user-attachments/assets/6e10ff03-f4c9-43c3-8cb9-48ebf334eb86)
- ![image](https://github.com/user-attachments/assets/e998c1f0-5491-4c45-96c0-2733bff1251c)
- As a Student this works fine and most importantly its is automated so Yee Boi.

## Features
- Convert PDF pages to images.
- Remove black ink (text) and make the background transparent.
- Save the processed images as PNG files.

## Requirements

Make sure you have the following:
- That you have prescanned the pdf with a Scanner and Btw Mobile scanner appilcation also work.
- Also Make sure that After scanning Ink should be of some colour of black I haven't tried it with a physical scanner so I don't how it much effective it may be on that.
- Install the dependencies given the following requirements.txt

You can install these dependencies using pip by running:

```bash
pip install -r requirements.txt
```
***
## Developments
- Feel Free to Open PR Requests If you find better Configuration to optimize the Output results.