An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with document-processing-pipeline

A curated list of projects in awesome lists tagged with document-processing-pipeline .

https://github.com/shalinianandaphd/prism

pRISM is a repository that combines Retrieval-Augmented Generation (RAG) with a multi-LLM voting approach to create accurate and reliable AI-generated outputs. It integrates multiple language models, including Mistral, Claude 3.5, and OpenAI, to enhance performance through advanced consensus techniques

ai-transparency document-processing-pipeline fine-tuning legal-ai llm lora multi-model phi-4 semantic-ai weighted-majority

Last synced: 17 Jun 2026

https://github.com/akshar-raaj/document-processing

A fast, flexible API for extracting text from PDFs and images using smart file detection and OCR—perfect for automating your document workflows.

ai artificial-intelligence document-processing-pipeline ocr optical-character-recognition tesseract textract

Last synced: 30 Jun 2025

https://github.com/jasoncobra3/floorplan-dimractor

A sophisticated Python pipeline for automatically extracting dimensions and cabinet codes from architectural floorplan PDFs. This tool converts various dimension formats into standardized measurements and provides structured output with visualization capabilities.

architecture-tools automation-tools blueprint-analysis cad-automation computer-vision dimension-extraction document-processing document-processing-pipeline floorplan-analysis image-processing measurement-tools opencv pdf-parser pdf-processing pdfplumber pymupdf streamlit text-detection

Last synced: 18 Apr 2026