https://github.com/komed3/img2txt

Precise text extraction from images and PDF documents
https://github.com/komed3/img2txt

image-processing ocr pdf-processing tesseract-ocr text-extraction

Last synced: about 2 months ago
JSON representation

Precise text extraction from images and PDF documents

Host: GitHub
URL: https://github.com/komed3/img2txt
Owner: komed3
License: mit
Created: 2026-03-26T08:42:02.000Z (4 months ago)
Default Branch: master
Last Pushed: 2026-03-26T17:44:13.000Z (4 months ago)
Last Synced: 2026-03-27T03:47:49.663Z (4 months ago)
Topics: image-processing, ocr, pdf-processing, tesseract-ocr, text-extraction
Language: JavaScript
Homepage: https://komed3.github.io/img2txt/
Size: 412 KB
Stars: 1
Watchers: 0
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# img2txt

Free web-based tool for precise text extraction from images and PDF documents relying on local OCR processing via Tesseract WebAssembly.

## Features

- **Precise text extraction**: Uses [Tesseract WebAssembly](https://github.com/tesseract-ocr) for accurate OCR processing.
- **Local processing**: All OCR processing happens locally in the browser, ensuring data privacy.
- **Image and PDF support**: Supports both image and PDF documents.
- **Interactive region selection**: Allows users to select specific regions for text extraction.
- **Zoom and pan**: Supports zooming and panning for better region selection.
- **Rotation**: Supports rotating images and PDFs for better OCR results.
- **Multi-language support**: Supports multiple languages for OCR processing.

## Usage

1. Upload an image or PDF document.
2. Select the regions for text extraction. Will be processed in the order they are selected.
3. Click the "Extract Text" button to extract text from the selected regions.
4. The extracted text will be formatted and displayed, ready to be copied to the clipboard.

## License

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/komed3/img2txt

Awesome Lists containing this project

README