Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/sambhav/ir-system

An information retrieval system for a comparative analysis of TF-IDF and BM25 ranking mechanisms
https://github.com/sambhav/ir-system

bm25 comparative-analysis information-retrieval reddit scraper tf-idf whoosh

Last synced: 7 days ago
JSON representation

An information retrieval system for a comparative analysis of TF-IDF and BM25 ranking mechanisms

Awesome Lists containing this project

README

        

# IR-system
An information retrieval system for a comparative analysis of TF-IDF and BM25 ranking mechanisms

## Setting up the repo
* Clone the repo
* Create a new virtual environment using

virtualenv -p python3 venv

* Activate the virtual environement via

source venv/bin/activate

* Install the repo requirements via

python setup.py install

* To scrape documents use

irs scrape

* To create an index use

irs create_index

* To index dumped data use

irs index_documents $JSON_PATH

* To show results use

irs run