https://github.com/jcaperella29/document_cleaning_cli
🧠AI-powered pipeline for cleaning scanned documents. Removes noise, enhances text, auto-tunes model weights, and returns OCR-optimized PDFs via CLI or cloud API.
https://github.com/jcaperella29/document_cleaning_cli
auto-tune batch-processing cli-tool cloud-run computer-vision deep-learning denoising document-ai document-processing fastapi image-enhancement image-processing ocr ocr-pipeline ocr-pipelines pytesseract python rest-api scanned-documents
Last synced: 9 days ago
JSON representation
🧠AI-powered pipeline for cleaning scanned documents. Removes noise, enhances text, auto-tunes model weights, and returns OCR-optimized PDFs via CLI or cloud API.
- Host: GitHub
- URL: https://github.com/jcaperella29/document_cleaning_cli
- Owner: jcaperella29
- Created: 2025-02-15T19:43:33.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2026-05-26T23:30:27.000Z (21 days ago)
- Last Synced: 2026-05-27T01:20:15.103Z (21 days ago)
- Topics: auto-tune, batch-processing, cli-tool, cloud-run, computer-vision, deep-learning, denoising, document-ai, document-processing, fastapi, image-enhancement, image-processing, ocr, ocr-pipeline, ocr-pipelines, pytesseract, python, rest-api, scanned-documents
- Language: MATLAB
- Homepage: https://document-cleaning-cli-111-777-888-7777-934773375188.us-central1.run.app/
- Size: 94.7 MB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md