Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/pronzzz/diabetes-prediction

Diabetes prediction using a KNN model and Pima Indian Diabetes Dataset
https://github.com/pronzzz/diabetes-prediction

Last synced: 14 days ago
JSON representation

Diabetes prediction using a KNN model and Pima Indian Diabetes Dataset

Awesome Lists containing this project

README

        

## **ο»ΏπŸ€– Diabetes Prediction using Machine Learning πŸ“Š**

- Pandas, NumPy, Matplotlib, and Seaborn are used for data manipulation and visualization πŸ“ˆ

- Data Cleaning 🧹
- Missing Values: No missing values 😊
- Duplicated Values: None found πŸ‘

- Exploratory Data Analysis πŸ”Ž
- Outcome Distribution: Balanced βš–οΈ
- Outlier Detection: A few in some features πŸ‘€

- Data Preprocessing πŸ› οΈ
- Standard Scaling: Applied for better model performance 🎚️
- Label Encoding: Outcome variable (0/1) 🎯

- Model Training and Evaluation πŸš‚
- K-Nearest Neighbors Classifier 🀝
- Grid Search for Optimal Hyperparameter (k) πŸ”§
- Max Train Score: 94.23% at k = 14 πŸ₯‡
- Max Test Score: 88.89% at k = 13 πŸ†

- Performance Evaluation πŸ“Š
- Confusion Matrix: Shows model's prediction accuracy β¬œβ¬›
- Classification Report: Detailed metrics (Precision, Recall, F1-score) πŸ‘Œ

- Conclusion 🏁
- KNN Classifier with k = 13 provides the best balance between train and test scores πŸ†
- Model can effectively predict diabetes based on patient's features πŸ˜„ο»Ώ