Projects in Awesome Lists tagged with docling
A curated list of projects in awesome lists tagged with docling .
https://github.com/ggozad/haiku.rag
Opinionated agentic RAG powered by LanceDB, Pydantic AI, and Docling
ai docling lancedb mcp mcp-server ml pydantic-ai rag
Last synced: 20 May 2026
https://github.com/shoryasethia/markdrop
A Python package for converting PDFs to markdown while extracting images and tables, generate descriptive text descriptions for extracted tables/images using several LLM clients. And many more functionalities. Markdrop is available on PyPI.
agents docling image-to-text llm markdrop marker markitdown open-source pdf-to-markdown pdf-to-text pypi-package table-to-text
Last synced: 14 Jan 2026
https://github.com/stevereiner/flexible-graphrag
Flexible GraphRAG: Python, LlamaIndex, Docker Compose: 8 Graph dbs, 10 Vector dbs, OpenSearch, Elasticsearch, Alfresco. 13 data sources (9 auto-sync), KG auto-building, schemas, LLMs, Docling or LlamaParse doc processing, GraphRAG, RAG only, Hybrid search, AI chat. React, Vue, Angular frontends, FastAPI backend, REST API, MCP Server. Please 🌟 Star
ai ai-chat alfresco arcadedb doc-processing docling falkordb generative-ai graphrag hybrid-search knowledge-graph llamaindex llamaparse llm mcp mcp-server neo4j python rag search
Last synced: 16 May 2026
https://github.com/fahdmirza/doclingwithollama
Docling with Ollama - RAG on Local Files with Local Models
docling ollama pdf-converter retrieval-augmented-generation
Last synced: 25 Oct 2025
https://github.com/ibm/docling-graph
Transform unstructured documents into validated, rich and queryable knowledge graphs.
ai convert docling document-processing knowledge-graph
Last synced: 26 Jan 2026
https://github.com/arconia-io/arconia-migrations
Tools and OpenRewrite recipes to automate refactoring, migrations, and upgrades for Java projects.
arconia docling java junit migrations open-rewrite spring-boot testcontainers
Last synced: 01 Apr 2026
https://github.com/versionhq/multi-agent-system
Autonomous agent networks for task automation that requires multi-step reasoning
agentic-ai autonomous-agents composiotool docling graph-theory langchain litellm matplotlib mem0ai multi-agent-systems multi-step-reasoning networkx orchestration-framework pydantic pygraphviz python3 rag self-directed-learning
Last synced: 08 Oct 2025
https://github.com/gyunggyung/docling-translate
Advanced PDF/Document Translator with interactive comparison. Built on IBM Docling.
ai docling docs document-parser docx html huggingface lfm2 llm nlp pdf pdf-converter pdf-to-text pptx python qwen3 streamlit tables translation yanolja
Last synced: 07 Mar 2026
https://github.com/ya0002/obsidian-assist
Make Zettelkasten-style note-taking the foundation of interactions with Large Language Models (LLMs).
docling graph-based knowledge-graph large-language-models llm llm-privacy obsidian obsidian-assist obsidian-md ollama second-brain zettlekasten
Last synced: 20 Aug 2025
https://github.com/btwld/docling-sdk
A TypeScript SDK for Docling - Bridge between the Python Docling ecosystem and JavaScript/TypeScript.
docling docling-sdk docling-ts ocr
Last synced: 15 May 2026
https://github.com/parthapray/docling_rag_langchain_colab
This repo contains codes for RAG using docling on colab notebook with langchain, milvus, huggingface embedding model and LLM
all-minilm-l6-v2 chunking colab-notebook docling huggingface langchain large-language-models milvus pdf retrieval-augmented-generation sentence-transformers
Last synced: 18 May 2026
https://github.com/jarus77/markdrop
A Python package for converting PDFs to markdown while extracting images and tables, generate descriptive text descriptions for extracted tables/images using several LLM clients. And many more functionalities. Markdrop is available on PyPI.
agents docling image-to-text llm markdown markdrop marker opensource pdf-to-markdown pdf-to-te pypi-package table-to-text
Last synced: 13 Apr 2026
https://github.com/parthapray/docling_colab
This repo contains google colab notebook for handing Docling for data extraction such as text, image, table etc.
chunk chunking colab-notebook docling docx embed extraction-data image lancedb markdown pdf pptx retrieval-augmented-generation table text transformers
Last synced: 16 May 2026
https://github.com/quarkiverse/quarkus-docling
Docling simplifies document processing, parsing diverse formats — including advanced PDF understanding — and providing seamless integrations with the gen AI ecosystem
ai docling document-processing embedding quarkus quarkus-extension rag
Last synced: 01 Jan 2026
https://github.com/maciekmalachowski/cvwizard
🧙‍♂️AI-powered tool to optimize your CV with job-specific keywords and align it to your dream job.
ai-powered beautifulsoup docling flask javascript llama-index openai python rag react-vite resume tailwindcss typescript
Last synced: 11 Apr 2026
https://github.com/rishang/deep-research
Python SDK for Deep-Research
ai deep-research deepresearch docling litellm
Last synced: 12 Jun 2025
https://github.com/padrio/doclens
Make image-heavy PDFs grep-able for AI agents. Convert PDF corpora into structured Markdown where every diagram, screenshot and table is searchable as text. No vector DB, no embeddings - just clean Markdown with LLM-generated image descriptions.
ai-agents anthropic claude docling document-processing knowledge-base llm markdown ocr pdf rag-alternative
Last synced: 26 Apr 2026
https://github.com/kwame-mintah/python-langchain-chainlit-qdrant-ollama-stack-template
đź“„ A template for project for creating a chainlit application, using a locally run model via ollama and qdrant vector database for document retrieval.
chainlit deepseek-r1 docker docker-compose docling huggingface langchain ollama qdrant
Last synced: 09 Apr 2026
https://github.com/kavyakapoor420/haqdarshak-stackoverflow-project
This project aims to build an AI-powered knowledge sharing system that enables agents to ask and answer questions, contribute verified learnings, and organically grow a community driven knowledge base much like a “StackOverflow for Scheme Agents.” The system will also promote community engagement, ultimately improving retention and performace.
docling elasticsearch frontend mongodb rag reactjs vectordb
Last synced: 04 Apr 2026
https://github.com/shijincai/fast360
The industry's first "Open Source OCR Arena," a free, no-login utility for one-click benchmarking of 7 top-tier models (Marker, MinerU, MonkeyOCR, Docling, Dolphin, OCRFlux, PP-StructureV3) on your PDF/image files, specializing in PDF-to-Markdown conversion.
benchmark computer-vision data-extraction docling document-analysis document-parser evaluation latex latex-document machine-learning markdown-converter marker monkeyocr ocr ocr-service paddleocr pdf-converter pdf-to-markdown rag
Last synced: 18 May 2026
https://github.com/miku/doclingclient
A Go docling client library and CLI
chunking docling document-conversion markdown pdf
Last synced: 26 May 2026
https://github.com/parthapray/gradio_docling_rag_langchain
This repo provide RAG using Docling, langchain, milvus, sentence transformers, huggingface LLMs
docling html image large-language-models milvus pdf pptx retrieval-augmented-generation sentence-transformers
Last synced: 27 Feb 2025
https://github.com/patw/docinator
A small service to convert PDF files to Markdown using the Docling library
Last synced: 26 Apr 2026
https://github.com/katagaki/lingus
PDF and Markdown conversion using Docling and LibreOffice
Last synced: 16 Oct 2025
https://github.com/k0msenapati/waterwise
đź’§ Smart Water Chatbot
chainlit chroma docling langchain openrouter python rag uv water
Last synced: 16 Apr 2026
https://github.com/hyoaru/rag4jiya-process
Agentic RAG-based system with nursing handbooks and transes as knowledge base for my bebiloves
ai-agent chromadb docling pydantic pydantic-ai pydantic-graph retrieval-augmented-generation
Last synced: 16 Apr 2026
https://github.com/docling-project/docling4j
Docling4j brings the functionalities of Docling in document understanding to Java® projects
ai docling document-parser document-parsing document-understanding documents java pdf pdf-converter pdf-to-json
Last synced: 15 Jun 2025