Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/pronzzz/diabetes-prediction
Diabetes prediction using a KNN model and Pima Indian Diabetes Dataset
https://github.com/pronzzz/diabetes-prediction
data-analysis data-manipulation data-preprocessing data-visualization knn machine-learning outlier-detection seaborn
Last synced: 22 days ago
JSON representation
Diabetes prediction using a KNN model and Pima Indian Diabetes Dataset
- Host: GitHub
- URL: https://github.com/pronzzz/diabetes-prediction
- Owner: pronzzz
- Created: 2024-01-23T01:23:08.000Z (12 months ago)
- Default Branch: main
- Last Pushed: 2024-01-28T04:35:06.000Z (12 months ago)
- Last Synced: 2024-11-06T05:05:02.216Z (2 months ago)
- Topics: data-analysis, data-manipulation, data-preprocessing, data-visualization, knn, machine-learning, outlier-detection, seaborn
- Language: Jupyter Notebook
- Homepage:
- Size: 4.65 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
## **ο»Ώπ€ Diabetes Prediction using Machine Learning π**
- Pandas, NumPy, Matplotlib, and Seaborn are used for data manipulation and visualization π
- Data Cleaning π§Ή
- Missing Values: No missing values π
- Duplicated Values: None found π- Exploratory Data Analysis π
- Outcome Distribution: Balanced βοΈ
- Outlier Detection: A few in some features π- Data Preprocessing π οΈ
- Standard Scaling: Applied for better model performance ποΈ
- Label Encoding: Outcome variable (0/1) π―- Model Training and Evaluation π
- K-Nearest Neighbors Classifier π€
- Grid Search for Optimal Hyperparameter (k) π§
- Max Train Score: 94.23% at k = 14 π₯
- Max Test Score: 88.89% at k = 13 π- Performance Evaluation π
- Confusion Matrix: Shows model's prediction accuracy β¬β¬
- Classification Report: Detailed metrics (Precision, Recall, F1-score) π- Conclusion π
- KNN Classifier with k = 13 provides the best balance between train and test scores π
- Model can effectively predict diabetes based on patient's features πο»Ώ