An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with document-parsing

A curated list of projects in awesome lists tagged with document-parsing .

https://github.com/enoch3712/ExtractThinker

ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.

ai document-image-analysis document-intelligence document-parsing document-processing langchain llm machine-learning nlp ocr openai pdf pdf-to-text python

Last synced: 04 Apr 2025

https://github.com/enoch3712/extractthinker

ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.

ai document-image-analysis document-intelligence document-parsing document-processing langchain llm machine-learning nlp ocr openai pdf pdf-to-text python

Last synced: 14 May 2025

https://github.com/harishdeivanayagam/rowfill

Open-source unstructured data (PDFs, Images, Audiofiles) processing platform built for knowledge workers

document document-extraction document-parsing image-ocr langgraph llama llm nextjs ocr ocr-javascript ollama openai pdf pdfs unstructured unstructured-data vision vision-api

Last synced: 13 Apr 2025

https://github.com/j-sephb-lt-n/pdf-bank-statement-parser

Tool for converting First National Bank (FNB) bank statement PDFs into useful structured data

bank banking document-parsing financial-analysis first-national-bank fnb pdf-parser pdf-parsing python

Last synced: 31 Aug 2025

https://github.com/ziming/laravel-docparser

Docparser OCR Package for PHP Laravel

doc-parser docparser document-parsing laravel ocr php

Last synced: 04 May 2025

https://github.com/cr4yfish/docling-js

Parsing Documents to one datatype (Typescript port of Docling)

document-parser document-parsing genai pdf-converter pdf-to-text

Last synced: 31 Aug 2025

https://github.com/setiaafandi/anyparser_crewai

Supercharge your AI workflows by combining Anyparser’s advanced content extraction with Crew AI. With this integration, you can effortlessly leverage Anyparser’s document processing and data extraction tools within your Crew AI applications.

anyparser cache-augmented-generation cag crew-ai crew-ai-rag crewai-rag document-parser document-parsing kag knowledge-graph python rag retrieval-augmented-generation typescript

Last synced: 07 Mar 2025

https://github.com/docling-project/docling4j

Docling4j brings the functionalities of Docling in document understanding to Java® projects

ai docling document-parser document-parsing document-understanding documents java pdf pdf-converter pdf-to-json

Last synced: 15 Jun 2025

https://github.com/anyparser/anyparser_crewai

Supercharge your AI workflows by combining Anyparser’s advanced content extraction with Crew AI. With this integration, you can effortlessly leverage Anyparser’s document processing and data extraction tools within your Crew AI applications.

anyparser artificial-intelligence cache-augmented-generation cag crew-ai crew-ai-rag crewai crewai-rag document-parser document-parsing kag knowledge-graph python rag retrieval-augmented-generation typescript

Last synced: 04 Oct 2025

https://github.com/imnotamr/english-to-french-app-using-streamlit-

An interactive Streamlit app that translates English text and documents to French, featuring Google Translate API integration and text-to-speech functionality. Includes PDF and Word document translation.

ai-projects cloud-deployment deep-learning document-parsing machine-translation nlp openai-projects python-tools speech-synthesis streamlit streamlit-application streamlit-cloud streamlit-webapp text-analysis text-to-speech translation voice-output

Last synced: 02 Apr 2025