Projects in Awesome Lists tagged with document-image-processing
A curated list of projects in awesome lists tagged with document-image-processing .
https://github.com/unstructured-io/unstructured
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
data-pipelines deep-learning document-image-analysis document-image-processing document-parser document-parsing docx donut information-retrieval langchain llm machine-learning ml natural-language-processing nlp ocr pdf pdf-to-json pdf-to-text preprocessing
Last synced: 18 Apr 2025
https://github.com/Unstructured-IO/unstructured
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
data-pipelines deep-learning document-image-analysis document-image-processing document-parser document-parsing docx donut information-retrieval langchain llm machine-learning ml natural-language-processing nlp ocr pdf pdf-to-json pdf-to-text preprocessing
Last synced: 26 Mar 2025
https://github.com/layout-parser/layout-parser
A Unified Toolkit for Deep Learning Based Document Image Analysis
computer-vision deep-learning detectron2 document-image-processing document-layout-analysis layout-analysis layout-detection layout-parser object-detection ocr
Last synced: 23 Apr 2025
https://github.com/Layout-Parser/layout-parser
A Unified Toolkit for Deep Learning Based Document Image Analysis
computer-vision deep-learning detectron2 document-image-processing document-layout-analysis layout-analysis layout-detection layout-parser object-detection ocr
Last synced: 15 Mar 2025
https://github.com/caltechlibrary/documentarist
Process Caltech Archives' digital documents and photos, and annotate each page or image with information about its contents
annotation annotator document-classification document-image-classification document-image-processing handwriting-recognition handwritten-character-recognition handwritten-mathematical-symbols handwritten-text-recognition htr image-classification image-recognition image-tagging machine-learning math-recognition tagging
Last synced: 14 Apr 2025
https://github.com/tony-xlh/quality-evaluation-of-scanned-document-images
A web app evaluating the quality the scanned document images
document-image-processing image-quality-assessment
Last synced: 24 Feb 2025
https://github.com/sfikas/sophia-trikoupi-handwritten-dataset
Sophia Trikoupi dataset (Collection of 46 handwritten, annotated pages)
dataset document-image-processing greek-language
Last synced: 04 Mar 2025