Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/trilokida/classification_metrics


https://github.com/trilokida/classification_metrics

Last synced: 4 days ago
JSON representation

Awesome Lists containing this project

README

        

# Classification_metrics

**Dataset** can be download from https://www.kaggle.com/uciml/pima-indians-diabetes-database
* *Pregnancies:- Number of times pregnant*
* *Glucose:- Plasma glucose concentration a 2 hours in an oral glucose tolerance test*
* *BloodPressure:- Diastolic blood pressure (mm Hg)*
* *SkinThickness:- Triceps skin fold thickness (mm)*
* *Insulin:- 2-Hour serum insulin (mu U/ml)*
* *BMI:- Body mass index (weight in kg/(height in m)^2)*
* *DiabetesPedigreeFunction:- Diabetes pedigree function*
* *Age:- Age (years)*
* *Outcome:- Class variable (0 or 1)*

## Highlight some basic points with code like:
* Classification accuracy: percentage of correct predictions
* Null accuracy
* Comparing the true and predicted response values
* **Confusion matrix**
* **Metrics computed from a confusion matrix**
* **Classification Accuracy:** Overall, how often is the classifier correct
* **Classification Error:** Overall, how often is the classifier incorrect?
* **Sensitivity:** When the actual value is positive, how often is the prediction correct?
* **Specificity:** When the actual value is negative, how often is the prediction correct?
* **False Positive Rate:** When the actual value is negative, how often is the prediction incorrect?
* **Precision:** When a positive value is predicted, how often is the prediction correct?
* **Adjusting the classification threshold**
* **ROC Curves and Area Under the Curve (AUC)**
* AUC is the percentage of the ROC plot that is underneath the curve.