https://github.com/lamouchi-bayrem/document_scanner
flask web app that scans documents using OpenCV
https://github.com/lamouchi-bayrem/document_scanner
ajax document extract-data flask ia ocr-recognition scanning-tool sql-server tailwindcss
Last synced: 8 months ago
JSON representation
flask web app that scans documents using OpenCV
- Host: GitHub
- URL: https://github.com/lamouchi-bayrem/document_scanner
- Owner: Lamouchi-Bayrem
- Created: 2025-05-06T16:08:12.000Z (8 months ago)
- Default Branch: main
- Last Pushed: 2025-05-06T19:04:30.000Z (8 months ago)
- Last Synced: 2025-05-12T01:55:08.923Z (8 months ago)
- Topics: ajax, document, extract-data, flask, ia, ocr-recognition, scanning-tool, sql-server, tailwindcss
- Language: Python
- Homepage:
- Size: 4.09 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# OCR Text Extractor 🔍🖼️
A sleek, responsive web application that allows users to extract text from images and PDF files instantly using Optical Character Recognition (OCR). Built with Bootstrap, Animate.css, and a clean UI, this project offers fast, secure, and accurate OCR capabilities for a seamless user experience.
## 🌐 Live Demo
Coming soon...
## 📸 Features
- 📤 **Drag & Drop Upload**: Easily upload images or PDFs.
- ⚡ **Fast Processing**: OCR runs instantly upon upload.
- ✅ **High Accuracy**: Extracts text reliably from various formats.
- 🔐 **Privacy First**: Files are processed securely, without permanent storage.
- 📝 **Text Editing**: Review and edit extracted content.
- 📋 **Copy or Download**: Export your text or copy it to clipboard.
## 🛠️ Technologies Used
- **Frontend**:
- HTML5 / CSS3
- [Bootstrap 5](https://getbootstrap.com/)
- [Bootstrap Icons](https://icons.getbootstrap.com/)
- [Animate.css](https://animate.style/)
- [Google Fonts (Poppins)](https://fonts.google.com/specimen/Poppins)
- **Backend (Not included here)**:
- Can be integrated with Flask, FastAPI, Node.js, etc.
- OCR with [Tesseract OCR](https://github.com/tesseract-ocr/tesseract) or [EasyOCR](https://github.com/JaidedAI/EasyOCR)
## 🚀 How It Works
1. **Upload Your Image**: Drag-and-drop or browse files (`.jpg`, `.png`, `.pdf`, etc.)
2. **Automatic Processing**: OCR analyzes and extracts text.
3. **Review & Edit**: Edit extracted content if needed.
4. **Export**: Copy or download the text.
## 📁 Project Structure
OCR-Text-Extractor/
│
├── static/
│ ├── style.css # Custom styles
│ └── scripts.js # OCR upload logic, UI interactions
│
├── index.html # Main HTML interface
├── README.md # You're reading it
## 🖼️ Screenshots

## 👨💻 Author
**Lamouchi Med Bayrem**
🔗 [GitHub](https://github.com/Lamouchi-Bayrem)
🔗 [LinkedIn](https://www.linkedin.com/in/lamouchi-med-bayrem/)
## 📄 License
This project is open-source and available under the MIT License.