Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/adilsezer/pdftextextractor

PdfTextExtractor is a Windows OS based VB.NET WinForms application that enable users to specify a directory to watch for new PDF files and automatically extract their text, using Tesseract OCR.
https://github.com/adilsezer/pdftextextractor

pdf tesseract-ocr

Last synced: about 2 months ago
JSON representation

PdfTextExtractor is a Windows OS based VB.NET WinForms application that enable users to specify a directory to watch for new PDF files and automatically extract their text, using Tesseract OCR.

Awesome Lists containing this project

README

        

# PdfTextExtractor
PdfTextExtractor is a Windows OS based VB.NET WinForms application that enable users to specify a directory to watch for new PDF files and automatically extract their text, using Tesseract OCR.

## Basic Features
* Select a folder to monitor new PDF files
* Start and Stop folder watcher
* Extract text from scanned or digital documents
* Create a txt file, containing extracted text

## Requirements
* .NET Framework 4.8

## Screenshot
Main Application GUI

![Alt text](https://github.com/sezerad/PdfTextExtractor/blob/main/PdfTextExtractor/Screenshots/PdfTextExtractorGUI.PNG?raw=true "PdfTextExtractor")