Projects in Awesome Lists tagged with document-processing-pipeline
A curated list of projects in awesome lists tagged with document-processing-pipeline .
https://github.com/shalinianandaphd/prism
pRISM is a repository that combines Retrieval-Augmented Generation (RAG) with a multi-LLM voting approach to create accurate and reliable AI-generated outputs. It integrates multiple language models, including Mistral, Claude 3.5, and OpenAI, to enhance performance through advanced consensus techniques
ai-transparency document-processing-pipeline fine-tuning legal-ai llm lora multi-model phi-4 semantic-ai weighted-majority
Last synced: 17 Jun 2026
https://github.com/akshar-raaj/document-processing
A fast, flexible API for extracting text from PDFs and images using smart file detection and OCR—perfect for automating your document workflows.
ai artificial-intelligence document-processing-pipeline ocr optical-character-recognition tesseract textract
Last synced: 30 Jun 2025
https://github.com/jasoncobra3/floorplan-dimractor
A sophisticated Python pipeline for automatically extracting dimensions and cabinet codes from architectural floorplan PDFs. This tool converts various dimension formats into standardized measurements and provides structured output with visualization capabilities.
architecture-tools automation-tools blueprint-analysis cad-automation computer-vision dimension-extraction document-processing document-processing-pipeline floorplan-analysis image-processing measurement-tools opencv pdf-parser pdf-processing pdfplumber pymupdf streamlit text-detection
Last synced: 18 Apr 2026