Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

https://github.com/AstraZeneca/rexmex

A general purpose recommender metrics library for fair evaluation.
https://github.com/AstraZeneca/rexmex

coverage deep-learning evaluation machine-learning metric metrics mrr personalization precision rank ranking recall recommender recommender-system recsys rsquared

Last synced: about 2 months ago
JSON representation

A general purpose recommender metrics library for fair evaluation.

Host: GitHub
URL: https://github.com/AstraZeneca/rexmex
Owner: AstraZeneca
Created: 2021-10-25T08:56:25.000Z (over 2 years ago)
Default Branch: main
Last Pushed: 2023-08-22T09:22:20.000Z (10 months ago)
Last Synced: 2024-03-23T03:01:35.863Z (2 months ago)
Topics: coverage, deep-learning, evaluation, machine-learning, metric, metrics, mrr, personalization, precision, rank, ranking, recall, recommender, recommender-system, recsys, rsquared
Language: Python
Homepage: https://rexmex.readthedocs.io/
Size: 2.68 MB
Stars: 276
Watchers: 8
Forks: 25
Open Issues: 3
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
- Citation: CITATION.cff

Lists

awesome-datascience - RexMex
awesome-machine-learning - RexMex - > A general purpose recommender metrics library for fair evaluation. (Python / General-Purpose Machine Learning)
awesome-machine-learning - RexMex - > A general purpose recommender metrics library for fair evaluation. (Python / General-Purpose Machine Learning)
awesome-machine-learnings - RexMex - > A general purpose recommender metrics library for fair evaluation. (Python / General-Purpose Machine Learning)
awesome-machine-learning - RexMex - > A general purpose recommender metrics library for fair evaluation. (Python / General-Purpose Machine Learning)
awesome-machine-learning - RexMex - > A general purpose recommender metrics library for fair evaluation. (Python / General-Purpose Machine Learning)
awesome-datascience - RexMex

README

        ![Version](https://badge.fury.io/py/rexmex.svg?style=plastic)

[![License](https://img.shields.io/badge/License-Apache_2.0-blue.svg)](https://opensource.org/licenses/Apache-2.0)

[![repo size](https://img.shields.io/github/repo-size/AstraZeneca/rexmex.svg)](https://github.com/AstraZeneca/rexmex/archive/master.zip)

[![build badge](https://github.com/AstraZeneca/rexmex/workflows/CI/badge.svg)](https://github.com/AstraZeneca/rexmex/actions?query=workflow%3ACI)

[![codecov](https://codecov.io/gh/AstraZeneca/rexmex/branch/main/graph/badge.svg?token=cYgAejRA0Z)](https://codecov.io/gh/AstraZeneca/rexmex)



  



--------------------------------------------------------------------------------

**reXmeX** is recommender system evaluation metric library.

Please look at the **[Documentation](https://rexmex.readthedocs.io/en/latest/)** and **[External Resources](https://rexmex.readthedocs.io/en/latest/notes/resources.html)**.

**reXmeX** consists of utilities for recommender system evaluation. First, it provides a comprehensive collection of metrics for the evaluation of recommender systems. Second, it includes a variety of methods for reporting and plotting the performance results. Implemented metrics cover a range of well-known metrics and newly proposed metrics from data mining ([ICDM](http://icdm2019.bigke.org/), [CIKM](http://www.cikm2019.net/), [KDD](https://www.kdd.org/kdd2020/)) conferences and prominent journals.

**Citing**

If you find *RexMex* useful in your research, please consider adding the following citation:

```bibtex

@inproceedings{rexmex,

       title = {{rexmex: A General Purpose Recommender Metrics Library for Fair Evaluation.}},

       author = {Benedek Rozemberczki and Sebastian Nilsson and Piotr Grabowski and Charles Tapley Hoyt and Gavin Edwards},

       year = {2021},

}

```

--------------------------------------------------------------------------------

**An introductory example**

The following example loads a synthetic dataset which has the mandatory `y_true` and `y_score` keys.  The dataset has binary labels and predictied probability scores. We read the dataset and define a defult `ClassificationMetric` instance for the evaluation of the predictions. Using this metric set we create a score card and get the predictive performance metrics.

```python

from rexmex import ClassificationMetricSet, DatasetReader, ScoreCard

reader = DatasetReader()

scores = reader.read_dataset()

metric_set = ClassificationMetricSet()

score_card = ScoreCard(metric_set)

report = score_card.get_performance_metrics(scores["y_true"], scores["y_score"])

```

--------------------------------------------------------------------------------

**An advanced example**

The following more advanced example loads the same synthetic dataset which has the `source_id`, `target_id`, `source_group` and `target group` keys besides the mandatory `y_true` and `y_score`.   Using the `source_group` key  we group the predictions and return a performance metric report.

```python

from rexmex import ClassificationMetricSet, DatasetReader, ScoreCard

reader = DatasetReader()

scores = reader.read_dataset()

metric_set = ClassificationMetricSet()

score_card = ScoreCard(metric_set)

report = score_card.generate_report(scores, grouping=["source_group"])

```

--------------------------------------------------------------------------------

**Scorecard**

A **rexmex** score card allows the reporting of recommender system performance metrics, plotting the performance metrics and saving those. Our framework provides 7 rating, 38 classification, 18 ranking, and 2 coverage metrics.

**Metric Sets**

Metric sets allow the users to calculate a range of evaluation metrics for a label - predicted label vector pair. We provide a general `MetricSet` class and specialized metric sets with pre-set metrics have the following general categories:

- **Ranking**

- **Rating**

- **Classification**

- **Coverage**

--------------------------------------------------------------------------------

**Ranking Metric Set**

* **[Normalized Distance Based Performance Measure (NDPM)](https://asistdl.onlinelibrary.wiley.com/doi/abs/10.1002/%28SICI%291097-4571%28199503%2946%3A2%3C133%3A%3AAID-ASI6%3E3.0.CO%3B2-Z)**

* **[Discounted Cumulative Gain (DCG)](https://en.wikipedia.org/wiki/Discounted_cumulative_gain)**

* **[Normalized Discounted Cumulative Gain (NDCG)](https://en.wikipedia.org/wiki/Discounted_cumulative_gain)**

* **[Reciprocal Rank](https://en.wikipedia.org/wiki/Mean_reciprocal_rank)**

Expand to see all ranking metrics in the metric set.

* **[Mean Reciprocal Rank (MRR)](https://en.wikipedia.org/wiki/Mean_reciprocal_rank)**

* **[Spearmanns Rho](https://en.wikipedia.org/wiki/Spearman%27s_rank_correlation_coefficient)**

* **[Kendall Tau](https://en.wikipedia.org/wiki/Kendall_rank_correlation_coefficient)**

* **[HITS@k](https://en.wikipedia.org/wiki/Evaluation_measures_(information_retrieval))**

* **[Novelty](https://www.sciencedirect.com/science/article/pii/S163107051930043X)**

* **[Average Recall @ k](https://en.wikipedia.org/wiki/Evaluation_measures_(information_retrieval))**

* **[Mean Average Recall @ k](https://en.wikipedia.org/wiki/Evaluation_measures_(information_retrieval))**

* **[Average Precision @ k](https://en.wikipedia.org/wiki/Evaluation_measures_(information_retrieval))**

* **[Mean Average Precision @ k](https://en.wikipedia.org/wiki/Evaluation_measures_(information_retrieval))**

* **[Personalisation](http://www.mavir.net/docs/tfm-vargas-sandoval.pdf)**

* **[Intra List Similarity](http://www.mavir.net/docs/tfm-vargas-sandoval.pdf)**

--------------------------------------------------------------------------------

**Rating Metric Set**

These metrics assume that items are scored explicitly and ratings are predicted by a regression model. 

* **[Mean Squared Error (MSE)](https://en.wikipedia.org/wiki/Mean_squared_error)**

* **[Root Mean Squared Error (RMSE)](https://en.wikipedia.org/wiki/Mean_squared_error)**

* **[Mean Absolute Error (MAE)](https://en.wikipedia.org/wiki/Mean_absolute_error)**

* **[Mean Absolute Percentage Error (MAPE)](https://en.wikipedia.org/wiki/Mean_absolute_percentage_error)**

Expand to see all rating metrics in the metric set.

* **[Symmetric Mean Absolute Percentage Error (SMAPE)](https://en.wikipedia.org/wiki/Symmetric_mean_absolute_percentage_error)**

* **[Pearson Correlation](https://en.wikipedia.org/wiki/Pearson_correlation_coefficient)**

* **[Coefficient of Determination](https://en.wikipedia.org/wiki/Coefficient_of_determination)**

--------------------------------------------------------------------------------

**Classification Metric Set**

These metrics assume that the items are scored with raw probabilities (these can be binarized).

* **[Precision (or Positive Predictive Value)](https://en.wikipedia.org/wiki/Precision_and_recall)**

* **[Recall (Sensitivity, Hit Rate, or True Positive Rate)](https://en.wikipedia.org/wiki/Precision_and_recall)**

* **[Area Under the Precision Recall Curve (AUPRC)](https://besjournals.onlinelibrary.wiley.com/doi/10.1111/2041-210X.13140)**

* **[Area Under the Receiver Operating Characteristic (AUROC)](https://en.wikipedia.org/wiki/Receiver_operating_characteristic)**

Expand to see all classification metrics in the metric set.

* **[F-1 Score](https://en.wikipedia.org/wiki/F-score)**

* **[Average Precision](https://scikit-learn.org/stable/modules/generated/sklearn.metrics.average_precision_score.html)**

* **[Specificty (Selectivity or True Negative Rate )](https://en.wikipedia.org/wiki/Precision_and_recall)**

* **[Matthew's Correlation](https://en.wikipedia.org/wiki/Precision_and_recall)**

* **[Accuracy](https://en.wikipedia.org/wiki/Precision_and_recall)**

* **[Balanced Accuracy](https://en.wikipedia.org/wiki/Precision_and_recall)**

* **[Fowlkes-Mallows Index](https://en.wikipedia.org/wiki/Precision_and_recall)**

--------------------------------------------------------------------------------

**Coverage Metric Set**

These metrics measure how well the recommender system covers the available items in the catalog and possible users. 

In other words measure the diversity of predictions.

* **[Item Coverage](https://www.bgu.ac.il/~shanigu/Publications/EvaluationMetrics.17.pdf)**

* **[User Coverage](https://www.bgu.ac.il/~shanigu/Publications/EvaluationMetrics.17.pdf)**

--------------------------------------------------------------------------------

**Documentation and Reporting Issues**

Head over to our [documentation](https://rexmex.readthedocs.io) to find out more about installation and data handling, a full list of implemented methods, and datasets.

If you notice anything unexpected, please open an [issue](https://github.com/AstraZeneca/rexmex/issues) and let us know. If you are missing a specific method, feel free to open a [feature request](https://github.com/AstraZeneca/rexmex/issues).

We are motivated to constantly make RexMex even better.

--------------------------------------------------------------------------------

**Installation via the command line**

RexMex can be installed with the following command after the repo is cloned.

```sh

$ pip install .

```

Use `-e/--editable` when developing.

**Installation via pip**

RexMex can be installed with the following pip command.

```sh

$ pip install rexmex

```

As we create new releases frequently, upgrading the package casually might be beneficial.

```sh

$ pip install rexmex --upgrade

```

--------------------------------------------------------------------------------

**Running tests**

Tests can be run with `tox` with the following:

```sh

$ pip install tox

$ tox -e py

```

--------------------------------------------------------------------------------

**Citation**

If you use RexMex in a scientific publication, we would appreciate citations. Please see GitHub's built-in citation tool.

--------------------------------------------------------------------------------

**License**

- [Apache-2.0 License](https://github.com/AZ-AI/rexmex/blob/master/LICENSE)