https://github.com/codename-detective/inverted-index-daat-python-flask
This repository hosts a Python-based project that implements an advanced Inverted Index using a Linked List structure, and Boolean Retrieval. It leverages Flask to create a web application that allows users to perform Boolean queries through a Document-at-a-time (DAAT) strategy. Optimized for fast retrieval and efficient storage.
https://github.com/codename-detective/inverted-index-daat-python-flask
information-retrieval linked-list
Last synced: 10 months ago
JSON representation
This repository hosts a Python-based project that implements an advanced Inverted Index using a Linked List structure, and Boolean Retrieval. It leverages Flask to create a web application that allows users to perform Boolean queries through a Document-at-a-time (DAAT) strategy. Optimized for fast retrieval and efficient storage.
- Host: GitHub
- URL: https://github.com/codename-detective/inverted-index-daat-python-flask
- Owner: CodeName-Detective
- Created: 2023-11-25T02:20:01.000Z (about 2 years ago)
- Default Branch: main
- Last Pushed: 2023-11-25T03:10:03.000Z (about 2 years ago)
- Last Synced: 2025-01-15T20:37:44.746Z (11 months ago)
- Topics: information-retrieval, linked-list
- Language: Python
- Homepage:
- Size: 251 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Inverted-Index-DAAT-Python-Flask
This repository contains the implementation of an Inverted Index using Linked Lists and Flask in Python. It allows for efficient Boolean queries using a Document-at-a-time (DAAT) strategy, ideal for understanding Information Retrieval concepts.
* Developed an Inverted Index using Python and Flask, integrating Linked Lists for efficient data management.
* Implemented a Document-at-a-Time (DAAT) Boolean query processing system for advanced information retrieval.
* Crafted a text preprocessing module for document tokenization and normalization.
* Enabled interactive querying through a Flask web application, demonstrating web development and information retrieval skills.
## Files Description
* **src/indexer.py:** Contains the core logic for building the inverted index.
* **src/linkedlist.py:** Implements the linked list data structure used in the index.
* **src/preprocessor.py:** Handles the preprocessing of documents, including tokenization and normalization.
* **src/run_project.py:** The main script to run the project. It sets up the Flask application and endpoints for querying.