Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/tathithienthanh/dataanalysis_diagnosis-of-diabetes-based-on-data-set-of-blood-test-result
Implement all learned knowledge about data analysis and data mining to make a complete project about Diagnosis of diabetes based on data set of blood test result
https://github.com/tathithienthanh/dataanalysis_diagnosis-of-diabetes-based-on-data-set-of-blood-test-result
blood-test classification clustering data-analysis data-processing decision-tree diabetes-prediction diagnosis exercise google-colab health hierarchical ipynb kmeans knn knns py python smote-sampling visualization
Last synced: about 20 hours ago
JSON representation
Implement all learned knowledge about data analysis and data mining to make a complete project about Diagnosis of diabetes based on data set of blood test result
- Host: GitHub
- URL: https://github.com/tathithienthanh/dataanalysis_diagnosis-of-diabetes-based-on-data-set-of-blood-test-result
- Owner: tathithienthanh
- Created: 2024-04-07T05:03:53.000Z (10 months ago)
- Default Branch: main
- Last Pushed: 2024-06-10T17:42:35.000Z (8 months ago)
- Last Synced: 2024-12-09T22:50:15.353Z (about 2 months ago)
- Topics: blood-test, classification, clustering, data-analysis, data-processing, decision-tree, diabetes-prediction, diagnosis, exercise, google-colab, health, hierarchical, ipynb, kmeans, knn, knns, py, python, smote-sampling, visualization
- Language: Jupyter Notebook
- Homepage:
- Size: 13.4 MB
- Stars: 2
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# DataAnalysis_Diagnosis-of-diabetes-based-on-data-set-of-blood-test-result
Implement all learned knowledge about data analysis and data mining to make a complete project about Diagnosis of diabetes based on data set of blood test result# Citation
The data were collected from the Iraqi society, as they data were acquired from the laboratory of Medical City Hospital and (the Specializes Center for Endocrinology and Diabetes-Al-Kindy Teaching Hospital).Please add this citation if you use this dataset for any further analysis.
*Rashid, Ahlam. Diabetes Dataset. PlumX Metrics, 2020.*
Link: https://plu.mx/plum/a?mendeley_data_id=wj9rwkp9c2&theme=plum-bigben-theme
# Detailed Information about the dataset
* https://www.kaggle.com/datasets/aravindpcoder/diabetes-dataset
* https://www.kaggle.com/datasets/simaanjali/diabetes-classification-dataset/data# About the report and code
* Used techniques: preprocessing, SMOTE, clustering with K-Means and Hierarchical, classification with KNNs and Decision Tree
* Re-edit the path if you use our code for importing or loading the dataset.
* The report is for reference only, please do not edit or use for other purposes.*All the files are done by me and @phamcongthuan, if you reuse the code please add the citation*