https://github.com/harshcasper/vyaadhi
This repository consists of the various Jupyter Notebooks that were written to perform analysis on the different Open-Source Datasets available on Health Parameters and different disease, namely: Breast Cancer, Diabetes Analysis, Heart Disease, Kidney Disease and Liver Disease.
https://github.com/harshcasper/vyaadhi
classification disease health machine-learning machine-learning-algorithms
Last synced: 11 months ago
JSON representation
This repository consists of the various Jupyter Notebooks that were written to perform analysis on the different Open-Source Datasets available on Health Parameters and different disease, namely: Breast Cancer, Diabetes Analysis, Heart Disease, Kidney Disease and Liver Disease.
- Host: GitHub
- URL: https://github.com/harshcasper/vyaadhi
- Owner: HarshCasper
- License: mit
- Created: 2019-09-14T11:37:06.000Z (over 6 years ago)
- Default Branch: master
- Last Pushed: 2020-01-29T16:10:05.000Z (over 6 years ago)
- Last Synced: 2025-04-07T16:51:50.652Z (about 1 year ago)
- Topics: classification, disease, health, machine-learning, machine-learning-algorithms
- Language: Jupyter Notebook
- Homepage:
- Size: 10.4 MB
- Stars: 7
- Watchers: 2
- Forks: 3
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Vyaadhi
[](https://www.codacy.com/manual/HarshCasper/Vyaadhi?utm_source=github.com&utm_medium=referral&utm_content=HarshCasper/Vyaadhi&utm_campaign=Badge_Grade)
This repository consists of the various Jupyter Notebooks that were written to perform analysis on the different Open-Source Datasets available on Health Parameters and different disease, namely: Breast Cancer, Diabetes Analysis, Heart Disease, Kidney Disease and Liver Disease.
## Accuracy
| | Logistic Regression | Naive Bayes | Support Vector Machines | Random Forest |
|----------------|---------------------|-------------|-------------------------|---------------|
| Breast Cancer | 96.4912% | 92.3977% | 95.91% | NaN |
| Liver Disease | 70.1149% | 53.4483% | 70.1149% | NaN |
| Diabetes | 78.355% | 76.1905% | 78.355% | NaN |
| Kidney Disease | 97.9167% | 100% | 100% | NaN |
| Heart Disease | 80.2198% | NaN | 81.32% | 91.21% |
## Datasets Used:
### Breast Cancer
The Breast Cancer Wisconsin (Diagnostic) Database available with sklearn was utilizes to create the dataset which has about 569 rows (cases) with 30 numeric features. The outcomes are either 1 - malignant, or 0 - benign.

### Liver Disease
The Liver Disease Database was utilized from an open-source [Kaggle](https://www.kaggle.com/uciml/indian-liver-patient-records) Database. The outcomes are two: Does the patient has liver disease or he does not have.

### Diabetes
An open source [Kaggle](https://www.kaggle.com/uciml/pima-indians-diabetes-database) was utilized for our Data Analysis and Machine Learning Processing.

### Kidney Disease
An open source [UCI Machine Learning Repository](https://archive.ics.uci.edu/ml/datasets/Chronic_Kidney_Disease) was utilized for our Data Analysis and Machine Learning Processing.

### Heart Disease
An open source [Kaggle Dataset](https://www.kaggle.com/ronitf/heart-disease-uci) was utilized for our Data Analysis and Machine Learning Processing.

## License
[MIT](https://choosealicense.com/licenses/mit/)