https://github.com/lamouchi-bayrem/document_scanner

flask web app that scans documents using OpenCV
https://github.com/lamouchi-bayrem/document_scanner

ajax document extract-data flask ia ocr-recognition scanning-tool sql-server tailwindcss

Last synced: 9 months ago
JSON representation

flask web app that scans documents using OpenCV

Host: GitHub
URL: https://github.com/lamouchi-bayrem/document_scanner
Owner: Lamouchi-Bayrem
Created: 2025-05-06T16:08:12.000Z (9 months ago)
Default Branch: main
Last Pushed: 2025-05-06T19:04:30.000Z (9 months ago)
Last Synced: 2025-05-12T01:55:08.923Z (9 months ago)
Topics: ajax, document, extract-data, flask, ia, ocr-recognition, scanning-tool, sql-server, tailwindcss
Language: Python
Homepage:
Size: 4.09 MB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# OCR Text Extractor 🔍🖼️

A sleek, responsive web application that allows users to extract text from images and PDF files instantly using Optical Character Recognition (OCR). Built with Bootstrap, Animate.css, and a clean UI, this project offers fast, secure, and accurate OCR capabilities for a seamless user experience.

## 🌐 Live Demo

Coming soon...

## 📸 Features

- 📤 **Drag & Drop Upload**: Easily upload images or PDFs.
- ⚡ **Fast Processing**: OCR runs instantly upon upload.
- ✅ **High Accuracy**: Extracts text reliably from various formats.
- 🔐 **Privacy First**: Files are processed securely, without permanent storage.
- 📝 **Text Editing**: Review and edit extracted content.
- 📋 **Copy or Download**: Export your text or copy it to clipboard.

## 🛠️ Technologies Used

- **Frontend**:
- HTML5 / CSS3
- [Bootstrap 5](https://getbootstrap.com/)
- [Bootstrap Icons](https://icons.getbootstrap.com/)
- [Animate.css](https://animate.style/)
- [Google Fonts (Poppins)](https://fonts.google.com/specimen/Poppins)

- **Backend (Not included here)**:
- Can be integrated with Flask, FastAPI, Node.js, etc.
- OCR with [Tesseract OCR](https://github.com/tesseract-ocr/tesseract) or [EasyOCR](https://github.com/JaidedAI/EasyOCR)

## 🚀 How It Works

1. **Upload Your Image**: Drag-and-drop or browse files (`.jpg`, `.png`, `.pdf`, etc.)
2. **Automatic Processing**: OCR analyzes and extracts text.
3. **Review & Edit**: Edit extracted content if needed.
4. **Export**: Copy or download the text.

## 📁 Project Structure

OCR-Text-Extractor/
│
├── static/
│ ├── style.css # Custom styles
│ └── scripts.js # OCR upload logic, UI interactions
│
├── index.html # Main HTML interface
├── README.md # You're reading it

## 🖼️ Screenshots

![App Demo](https://github.com/Lamouchi-Bayrem/Document_Scanner/blob/main/uploads/Capture%20d'%C3%A9cran%202025-05-06%20172303.png)

## 👨‍💻 Author

**Lamouchi Med Bayrem**
🔗 [GitHub](https://github.com/Lamouchi-Bayrem)
🔗 [LinkedIn](https://www.linkedin.com/in/lamouchi-med-bayrem/)

## 📄 License

This project is open-source and available under the MIT License.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/lamouchi-bayrem/document_scanner

Awesome Lists containing this project

README