https://github.com/chandkund/predicting-diabetes-onset
The "Predicting Diabetes Onset" project aims to build a machine learning model that predicts whether an individual has diabetes based on various health-related features. The dataset used for this project includes attributes related to medical history and physical measurements.
https://github.com/chandkund/predicting-diabetes-onset
deep-learning numpy pandas python seaborn sklearn-library sklearn-metrics visualization
Last synced: 3 months ago
JSON representation
The "Predicting Diabetes Onset" project aims to build a machine learning model that predicts whether an individual has diabetes based on various health-related features. The dataset used for this project includes attributes related to medical history and physical measurements.
- Host: GitHub
- URL: https://github.com/chandkund/predicting-diabetes-onset
- Owner: chandkund
- License: mit
- Created: 2024-09-01T19:42:37.000Z (almost 2 years ago)
- Default Branch: main
- Last Pushed: 2025-01-10T18:10:27.000Z (over 1 year ago)
- Last Synced: 2025-07-17T08:32:53.197Z (12 months ago)
- Topics: deep-learning, numpy, pandas, python, seaborn, sklearn-library, sklearn-metrics, visualization
- Language: Jupyter Notebook
- Homepage:
- Size: 679 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Predicting Diabetes Onset ๐ฉบ๐ก
## Project Overview
This project focuses on predicting the onset of diabetes using various machine learning models. The goal is to develop a predictive model based on health-related features to determine the likelihood of diabetes.
## Dataset
The dataset includes the following features:
- **Pregnancies**: Number of pregnancies ๐คฐ
- **Glucose**: Plasma glucose concentration (2 hours in an oral glucose tolerance test) ๐ฉธ
- **BloodPressure**: Diastolic blood pressure (mm Hg) ๐
- **SkinThickness**: Triceps skin fold thickness (mm) ๐
- **Insulin**: 2-Hour serum insulin (mu U/ml) ๐งช
- **BMI**: Body mass index (weight in kg/(height in m)^2) โ๏ธ
- **DiabetesPedigreeFunction**: A measure of likelihood of diabetes based on family history.
- **Age**: Age of the individual (years) ๐
- **Outcome**: Whether the individual has diabetes (1) or not (0) โ
โ
## Models Used
- **GaussianNB**
- **DecisionTreeClassifier**
- **RandomForestClassifier**
- **LogisticRegression**
- **AdaBoostClassifier**
- **XGBClassifier**
- **SVC**
- **GradientBoostingClassifier**
- **Neural Network**
## Installation
To run this project, you need to install the following Python libraries:
- pandas ๐
- numpy ๐งฎ
- scikit-learn ๐ ๏ธ
- xgboost ๐
- tensorflow (for Neural Networks) ๐ง
You can install them using pip:
```bash
pip install pandas numpy scikit-learn xgboost tensorflow
```
## Usage
Clone the repository:
```bash
git clone https://github.com/chandkund/Predicting-Diabetes-Onset
```
## Navigate to the project directory:
```bash
cd Predicting-Diabetes-Onset
```
## Run the main script:
```bash
python main.py
```
Results
The project evaluates the performance of various models on the dataset, providing insights into which model performs best for predicting diabetes onset.
License
This project is licensed under the MIT License - see the [LICENSE](LICENSE)file for details.