An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with document-chunking

A curated list of projects in awesome lists tagged with document-chunking .

https://github.com/messkan/rag-chunk

A Python CLI to test, benchmark, and find the best RAG chunking strategy for your Markdown documents.

chunking document-chunking embedding-vectors ia langchain llm nlp python rag rag-pipeline retrieval-augmented-generation text-splitting vector-search

Last synced: 05 Mar 2026

https://github.com/speedyk-005/chunklet-py

One library to split them all: Sentence, Code, Docs. Chunk smarter, not harder — built for LLMs, RAG pipelines, and beyond.

ai chunking chunks-algorithm chunks-processing code-chunking code-structure document-chunking natural-language-processing nlp rag text-splitting visualization

Last synced: 02 Mar 2026

https://github.com/davidmoserai/azuredocumentintelligencechunker

A lightweight Python library for metadata-rich document chunking in Retrieval-Augmented Generation (RAG) workflows. It leverages Azure AI Document Intelligence to enhance chunking by retaining hierarchical structure, page numbers, and bounding boxes for seamless integration with PDF viewers.

agent agents azure azure-ai-document-intelligence azure-ai-search chunking document-chunking langchain layout-parser layout-parsing llm production-grade python rag react react-pdf-viewer retrieval-augmented-generation unstructured-data

Last synced: 08 Feb 2026