An open API service indexing awesome lists of open source software.

https://github.com/storopoli/udacity-compas

Udacity's ML Engineer nanodegree capstone project - COMPAS Fair Classifier trained/tuned/deployed in AWS SageMaker
https://github.com/storopoli/udacity-compas

aws binary-classification classification fairness fairness-ml gradient-boosting machine-learning sagemaker xgboost

Last synced: 7 months ago
JSON representation

Udacity's ML Engineer nanodegree capstone project - COMPAS Fair Classifier trained/tuned/deployed in AWS SageMaker

Awesome Lists containing this project

README

          

# COMPAS Fair Classifier

This is a Capstone Project for the Udacity's [Machine Learning Engineer nanodegree](https://www.udacity.com/course/machine-learning-engineer-nanodegree--nd009t). The goal is to **train/tune/deploy a fair binary classifier** for recidivism using [COMPAS data](https://github.com/propublica/compas-analysis).

## COMPAS 2016 Scandal

In May 2016, [ProPublica published a report](https://www.propublica.org/article/machine-bias-risk-assessments-in-criminal-sentencing) regarding a judicial decision support algorithm that outputs a risk score for defendant redicivism. This algorithm is called COMPAS, short for *Correctional Offender Management Profiling for Alternative Sanctions*. It has been shown that COMPAS is extremely biased toward african-american offenders when compared to caucasian offenders for the same prior/post offenses.

## Model

The model employed was [SageMaker's `XGBoost`](https://docs.aws.amazon.com/sagemaker/latest/dg/xgboost.html) tunned to maximize MAP (mean average precision) in order to deal with original COMPAS' model unbalanced false positive rates.

## Results

### Original COMPAS (Dressel & Farid, 2018)

**Accuracy**: 60.6%

| | **African-American** | **Caucasian** |
| ----------------------- | -------------------- | ------------- |
| **False Positive Rate** | 40.4% | 25.4% |

### Proposed Model

**Accuracy**: 100%

| | **African-American** | **Caucasian** |
| ----------------------- | -------------------- | ------------- |
| **False Positive Rate** | 0% | 0% |

## Author

Jose Storopoli, PhD - [ORCID](https://orcid.org/0000-0002-0559-5176) - [CV](https://storopoli.github.io)

[thestoropoli@gmail.com](mailto:thestoropoli@gmail.com)

## References

Dressel, J., & Farid, H. (2018). The accuracy, fairness, and limits of predicting recidivism. *Science advances*, *4*(1), eaao5580. https://doi.org/10.1126/sciadv.aao5580