Projects in Awesome Lists tagged with document-indexing
A curated list of projects in awesome lists tagged with document-indexing .
https://github.com/kyr0/clientside-search
A highly efficient, isomorphic, full-featured, multilingual text search engine library, providing full-text search, fuzzy matching, phonetic scoring, document indexing and more, with micro JSON state hydration/dehydration in-browser and server-side.
bk-tree bm25 browser client-side damerau-levenshtein-distance document-indexing document-search full-text-search fuzzy-matching lucene multilingual nodejs phonetics search-engine state-hydration text-processing text-search tf-idf trie
Last synced: 04 May 2025
https://github.com/lethalbit/bookwurm
dead simple document index and search, nothing fancy
document-indexing document-search
Last synced: 07 Apr 2025
https://github.com/subhangisati/rag-using-deepseek-r1
This repository highlights my learning journey in building Retrieval-Augmented Generation (RAG) pipelines using DeepSeek on Lightning AI, covering document ingestion, retrieval, and integration with generative AI. It showcases fine-tuning, evaluation, and optimization for accurate open-domain QA and knowledge management.
api deepseek document-indexing embedding-models fine-tuning generative-ai gpt huggingface-transformers langchain llm rag
Last synced: 20 Mar 2025
https://github.com/maximlevchenko/boolean-model-implementations-comparison
The purpose of this project is also to compare the efficiency and performance of two different methods for handling search operations: the inverted index and the term-document matrix
boolean-model document-indexing flask full-stack inverted-index nltk-python python react search-engine term-document-matrix web-application
Last synced: 15 May 2025
https://github.com/trkotovicz/document-indexing-algorithm-py
Programa que simula um algoritmo de indexação de documentos similar ao do Google. Ele é capaz de identificar ocorrências de termos em arquivos TXT.
dequeue document-indexing doubly-linked-list-python estrutura-de-dados fifo lifo linked-lists python queue stack tads
Last synced: 25 Mar 2025
https://github.com/krisluczka/osse
Open Source Search Engine with built-in web/document crawler and an indexing method.
cpp document-indexing document-search document-searching indexing-engine search-engine web-crawler web-crawling web-indexer web-indexing
Last synced: 15 Apr 2025