Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

https://github.com/scikit-learn-contrib/skope-rules

machine learning with logical rules in Python
https://github.com/scikit-learn-contrib/skope-rules

Last synced: 20 days ago
JSON representation

machine learning with logical rules in Python

Host: GitHub
URL: https://github.com/scikit-learn-contrib/skope-rules
Owner: scikit-learn-contrib
License: other
Created: 2018-02-18T13:42:47.000Z (over 6 years ago)
Default Branch: master
Last Pushed: 2024-01-31T14:01:51.000Z (5 months ago)
Last Synced: 2024-04-25T10:02:47.271Z (about 2 months ago)
Language: Jupyter Notebook
Homepage: http://skope-rules.readthedocs.io
Size: 5.62 MB
Stars: 587
Watchers: 26
Forks: 94
Open Issues: 35
Metadata Files:
- Readme: README.md
- License: COPYING
- Authors: AUTHORS.rst

Lists

awesome-machine-learning-interpretability - skope-rules - learn-contrib/skope-rules?style=social) | "a Python machine learning module built on top of scikit-learn and distributed under the 3-Clause BSD license.” | (Technical Resources / Open Source/Access Responsible AI Software Packages)
Awesome-explainable-AI - https://github.com/scikit-learn-contrib/skope-rules - learn ![](https://img.shields.io/github/stars/scikit-learn-contrib/skope-rules?style=social) (Python Libraries(sort in alphabeta order) / Evaluation methods)
awesome_quantmetry - skope-rules - contrib) ![alt text][qm-contrib] (Interpretability / Explainable AI)
awesome-python-machine-learning-resources - GitHub - 80% open · ⏱️ 23.10.2020): (Sklearn实用程序)
my-awesome-stars - scikit-learn-contrib/skope-rules - machine learning with logical rules in Python (Jupyter Notebook)

README

        .. -*- mode: rst -*-

|Travis|_  |Coveralls|_ |CircleCI|_ |Python27|_ |Python35|_

.. |Travis| image:: https://api.travis-ci.org/skope-rules/skope-rules.svg?branch=master

.. _Travis: https://travis-ci.org/skope-rules/skope-rules

.. |Coveralls| image:: https://coveralls.io/repos/github/skope-rules/skope-rules/badge.svg?branch=master

.. _Coveralls: https://coveralls.io/github/skope-rules/skope-rules?branch=master

.. |CircleCI| image:: https://circleci.com/gh/skope-rules/skope-rules/tree/master.svg?style=shield&circle-token=:circle-token

.. _CircleCI: https://circleci.com/gh/skope-rules/skope-rules

.. |Python27| image:: https://img.shields.io/badge/python-2.7-blue.svg

.. _Python27: https://badge.fury.io/py/skope-rules

.. |Python35| image:: https://img.shields.io/badge/python-3.5-blue.svg

.. _Python35: https://badge.fury.io/py/skope-rules

.. image:: logo.png

skope-rules

===========

Skope-rules is a Python machine learning module built on top of

scikit-learn and distributed under the 3-Clause BSD license.

Skope-rules aims at learning logical, interpretable rules for "scoping" a target

class, i.e. detecting with high precision instances of this class.

Skope-rules is a trade off between the interpretability of a Decision Tree

and the modelization power of a Random Forest.

See the `AUTHORS.rst `_ file for a list of contributors.

.. image:: schema.png

Installation

------------

You can get the latest sources with pip :

    pip install skope-rules

   

Quick Start

------------

SkopeRules can be used to describe classes with logical rules :

.. code:: python

    from sklearn.datasets import load_iris

    from skrules import SkopeRules

    

    dataset = load_iris()

    feature_names = ['sepal_length', 'sepal_width', 'petal_length', 'petal_width']

    clf = SkopeRules(max_depth_duplication=2,

                     n_estimators=30,

                     precision_min=0.3,

                     recall_min=0.1,

                     feature_names=feature_names)

    

    for idx, species in enumerate(dataset.target_names):

        X, y = dataset.data, dataset.target

        clf.fit(X, y == idx)

        rules = clf.rules_[0:3]

        print("Rules for iris", species)

        for rule in rules:

            print(rule)

        print()

        print(20*'=')

        print()

::

SkopeRules can also be used as a predictor if you use the "score_top_rules" method :

.. code:: python

    from sklearn.datasets import load_boston

    from sklearn.metrics import precision_recall_curve

    from matplotlib import pyplot as plt

    from skrules import SkopeRules

    

    dataset = load_boston()

    clf = SkopeRules(max_depth_duplication=None,

                     n_estimators=30,

                     precision_min=0.2,

                     recall_min=0.01,

                     feature_names=dataset.feature_names)

    

    X, y = dataset.data, dataset.target > 25

    X_train, y_train = X[:len(y)//2], y[:len(y)//2]

    X_test, y_test = X[len(y)//2:], y[len(y)//2:]

    clf.fit(X_train, y_train)

    y_score = clf.score_top_rules(X_test) # Get a risk score for each test example

    precision, recall, _ = precision_recall_curve(y_test, y_score)

    plt.plot(recall, precision)

    plt.xlabel('Recall')

    plt.ylabel('Precision')

    plt.title('Precision Recall curve')

    plt.show()

::

For more examples and use cases please check our `documentation `_.

You can also check the `demonstration notebooks `_.

Links with existing literature

-------------------------------

The main advantage of decision rules is that they are offering interpretable models. The problem of generating such rules has been widely considered in machine learning, see e.g. RuleFit [1], Slipper [2], LRI [3], MLRules[4].

A decision rule is a logical expression of the form "IF conditions THEN response". In a binary classification setting, if an instance satisfies conditions of the rule, then it is assigned to one of the two classes. If this instance does not satisfy conditions, it remains unassigned.

1) In [2, 3, 4], rules induction is done by considering each single decision rule as a base classifier in an ensemble, which is built by greedily minimizing some loss function.

2) In [1], rules are extracted from an ensemble of trees; a weighted combination of these rules is then built by solving a L1-regularized optimization problem over the weights as described in [5].

In this package, we use the second approach. Rules are extracted from tree ensemble, which allow us to take advantage of existing fast algorithms (such as bagged decision trees, or gradient boosting) to produce such tree ensemble. Too similar or duplicated rules are then removed, based on a similarity threshold of their supports..

The main goal of this package is to provide rules verifying precision and recall conditions. It still implement a score (`decision_function`) method, but which does not solve the L1-regularized optimization problem as in [1]. Instead, weights are simply proportional to the OOB associated precision of the rule.

This package also offers convenient methods to compute predictions with the k most precise rules (cf score_top_rules() and predict_top_rules() functions).

[1] Friedman and Popescu, Predictive learning via rule ensembles,Technical Report, 2005.

[2] Cohen and Singer, A simple, fast, and effective rule learner, National Conference on Artificial Intelligence, 1999.

[3] Weiss and Indurkhya, Lightweight rule induction, ICML, 2000.

[4] Dembczyński, Kotłowski and Słowiński, Maximum Likelihood Rule Ensembles, ICML, 2008.

[5] Friedman and Popescu, Gradient directed regularization, Technical Report, 2004.

Dependencies

------------

skope-rules requires:

- Python (>= 2.7 or >= 3.3)

- NumPy (>= 1.10.4)

- SciPy (>= 0.17.0)

- Pandas (>= 0.18.1)

- Scikit-Learn (>= 0.17.1)

For running the examples Matplotlib >= 1.1.1 is required.

    

Documentation

--------------

You can access the full project documentation `here `_

You can also check the notebooks/ folder which contains some examples of utilization.