Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/arosh/BM25Transformer

(Python) transform a document-term matrix to an Okapi/BM25 representation
https://github.com/arosh/BM25Transformer

information-retrieval machine-learning natural-language-processing python scikit-learn

Last synced: about 2 months ago
JSON representation

(Python) transform a document-term matrix to an Okapi/BM25 representation

Awesome Lists containing this project

README

        

# BM25Transformer

## About

This library transforms a [document-term matrix](https://en.wikipedia.org/wiki/Document-term_matrix) to a [Okapi/BM25](https://en.wikipedia.org/wiki/Okapi_BM25) representation.

API of this library inherits from [sklearn.feature\_extraction.text.TfidfTransformer](http://scikit-learn.org/stable/modules/generated/sklearn.feature_extraction.text.TfidfTransformer.html).

## References

* [Okapi BM25: a non-binary model - Introduction to Information Retrieval](http://nlp.stanford.edu/IR-book/html/htmledition/okapi-bm25-a-non-binary-model-1.html)

## License

BSD 3 clause