An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with document-indexing

A curated list of projects in awesome lists tagged with document-indexing .

https://github.com/kyr0/clientside-search

A highly efficient, isomorphic, full-featured, multilingual text search engine library, providing full-text search, fuzzy matching, phonetic scoring, document indexing and more, with micro JSON state hydration/dehydration in-browser and server-side.

bk-tree bm25 browser client-side damerau-levenshtein-distance document-indexing document-search full-text-search fuzzy-matching lucene multilingual nodejs phonetics search-engine state-hydration text-processing text-search tf-idf trie

Last synced: 04 May 2025

https://github.com/lethalbit/bookwurm

dead simple document index and search, nothing fancy

document-indexing document-search

Last synced: 07 Apr 2025

https://github.com/subhangisati/rag-using-deepseek-r1

This repository highlights my learning journey in building Retrieval-Augmented Generation (RAG) pipelines using DeepSeek on Lightning AI, covering document ingestion, retrieval, and integration with generative AI. It showcases fine-tuning, evaluation, and optimization for accurate open-domain QA and knowledge management.

api deepseek document-indexing embedding-models fine-tuning generative-ai gpt huggingface-transformers langchain llm rag

Last synced: 20 Mar 2025

https://github.com/maximlevchenko/boolean-model-implementations-comparison

The purpose of this project is also to compare the efficiency and performance of two different methods for handling search operations: the inverted index and the term-document matrix

boolean-model document-indexing flask full-stack inverted-index nltk-python python react search-engine term-document-matrix web-application

Last synced: 15 May 2025

https://github.com/trkotovicz/document-indexing-algorithm-py

Programa que simula um algoritmo de indexação de documentos similar ao do Google. Ele é capaz de identificar ocorrências de termos em arquivos TXT.

dequeue document-indexing doubly-linked-list-python estrutura-de-dados fifo lifo linked-lists python queue stack tads

Last synced: 25 Mar 2025

https://github.com/krisluczka/osse

Open Source Search Engine with built-in web/document crawler and an indexing method.

cpp document-indexing document-search document-searching indexing-engine search-engine web-crawler web-crawling web-indexer web-indexing

Last synced: 15 Apr 2025