https://github.com/oshekharo/image2text-pro
An advanced, configurable OCR tool for extracting text from images with preprocessing and parallel processing capabilities. Optimized for Chinese text but supports multiple languages.
https://github.com/oshekharo/image2text-pro
mtl ocr opencv python tesseract-ocr
Last synced: 11 months ago
JSON representation
An advanced, configurable OCR tool for extracting text from images with preprocessing and parallel processing capabilities. Optimized for Chinese text but supports multiple languages.
- Host: GitHub
- URL: https://github.com/oshekharo/image2text-pro
- Owner: OshekharO
- Created: 2023-02-16T17:24:37.000Z (over 3 years ago)
- Default Branch: main
- Last Pushed: 2025-06-24T05:50:25.000Z (12 months ago)
- Last Synced: 2025-06-24T06:33:58.080Z (12 months ago)
- Topics: mtl, ocr, opencv, python, tesseract-ocr
- Language: Python
- Homepage:
- Size: 21.5 KB
- Stars: 2
- Watchers: 3
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Image2Text-Pro
Advanced OCR tool for extracting text from images with preprocessing and parallel processing.



## Features ✨
- 📷 Supports multiple image formats (JPG, PNG, TIFF, BMP)
- 🔍 Advanced image preprocessing for better OCR accuracy
- ⚡ Parallel processing for fast batch operations
- 🌍 Multi-language support (Chinese by default)
- 📊 Progress tracking and performance metrics
- 🛠️ Configurable preprocessing and OCR parameters
## Installation 🛠️
1. Install Tesseract OCR:
```bash
# On Ubuntu/Debian
sudo apt install tesseract-ocr
sudo apt install libtesseract-dev
# On macOS
brew install tesseract
```
2. Python Dependencies:
```python
pip install -r requirements.txt
```
## Usage 🚀
1. Basic Command:
`
python text_extractor.py -i input_images -o output_texts
`
3. Advanced Usage:
`
python text_extractor.py \
-i ./photos \
-o ./extracted_texts \
--lang eng+chi_sim \
--psm 11 \
--workers 8
`
## Contributing 🤝
We welcome contributions! Please:
1. Fork the repository
2. Create a feature branch (git checkout -b feature/your-feature)
3. Commit your changes (git commit -m 'Add some feature')
4. Push to the branch (git push origin feature/your-feature)
5. Open a Pull Request
Made with ❤️ and Python
OCR accuracy may vary depending on image quality and language complexity