An open API service indexing awesome lists of open source software.

https://github.com/raphsenn/info-retrieval-notebooks

Some information retrieval algorithms and datastructures (inverted index, ranking (bm25, tf, idf scores), fuzzy search, ...)
https://github.com/raphsenn/info-retrieval-notebooks

data-science fuzzy-search information-retrieval information-systems inverted-index searching-algorithms

Last synced: 6 months ago
JSON representation

Some information retrieval algorithms and datastructures (inverted index, ranking (bm25, tf, idf scores), fuzzy search, ...)

Awesome Lists containing this project

README

        

# info-retrieval-notebooks

* Designed for viewing in GitHub.

## Implemented Algorithms and Datastructures

### search
* InvertedIndex

* InvertedIndex (via vector space model, linear algebra, sparse matrices)

* Similarity search (via cosine similarity)

* Fuzzy string search

* Ranking and evaluation

### databases
* Basic database operations (project, select, cartesian product)

* more database operations (equi join, merge join, hash join, group by)

* SPARQL to SQL algorithm

* SQL to SPARQL algorithm

### Used datasets

#### IMDB movies dataset
[https://www.kaggle.com/datasets/ashpalsingh1525/imdb-movies-dataset](https://www.kaggle.com/datasets/ashpalsingh1525/imdb-movies-dataset)