Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/quasarbright/ir-final-project
ir-final-project
https://github.com/quasarbright/ir-final-project
Last synced: 26 days ago
JSON representation
ir-final-project
- Host: GitHub
- URL: https://github.com/quasarbright/ir-final-project
- Owner: quasarbright
- Created: 2022-12-13T23:28:49.000Z (about 2 years ago)
- Default Branch: master
- Last Pushed: 2022-12-14T04:34:03.000Z (about 2 years ago)
- Last Synced: 2024-11-06T07:42:17.612Z (3 months ago)
- Language: Jupyter Notebook
- Size: 54.7 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Mike Delmonaco ir-final-project
This repository contains the source code for my final project for a course on Information Retrieval.
For my project, I implemented the document processing component of a full-text ad-hoc search engine for racket
documentation.## Dependencies
* Python version 3.7.6 or greater
* Python packages listed in `requirements.txt`
* nltk punkt models. To install: https://www.nltk.org/data.html## How to use it
Once dependencies are installed, run `main.py` to build the index. It saves it to `out/index.pickle`.
To use a saved index for a search engine, load it from file with `load_index`.