Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/arosh/BM25Transformer
(Python) transform a document-term matrix to an Okapi/BM25 representation
https://github.com/arosh/BM25Transformer
information-retrieval machine-learning natural-language-processing python scikit-learn
Last synced: 2 months ago
JSON representation
(Python) transform a document-term matrix to an Okapi/BM25 representation
- Host: GitHub
- URL: https://github.com/arosh/BM25Transformer
- Owner: arosh
- License: bsd-3-clause
- Created: 2016-01-20T13:25:15.000Z (about 9 years ago)
- Default Branch: master
- Last Pushed: 2018-04-17T13:52:15.000Z (almost 7 years ago)
- Last Synced: 2024-08-06T03:03:21.200Z (6 months ago)
- Topics: information-retrieval, machine-learning, natural-language-processing, python, scikit-learn
- Language: Python
- Homepage: http://kujira16.hateblo.jp/entry/2016/01/20/235500
- Size: 2.93 KB
- Stars: 54
- Watchers: 4
- Forks: 6
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# BM25Transformer
## About
This library transforms a [document-term matrix](https://en.wikipedia.org/wiki/Document-term_matrix) to a [Okapi/BM25](https://en.wikipedia.org/wiki/Okapi_BM25) representation.
API of this library inherits from [sklearn.feature\_extraction.text.TfidfTransformer](http://scikit-learn.org/stable/modules/generated/sklearn.feature_extraction.text.TfidfTransformer.html).
## References
* [Okapi BM25: a non-binary model - Introduction to Information Retrieval](http://nlp.stanford.edu/IR-book/html/htmledition/okapi-bm25-a-non-binary-model-1.html)
## License
BSD 3 clause