{"id":28908022,"url":"https://github.com/djdurga/predictive_analysis_in_diabetes","last_synced_at":"2026-05-08T00:41:47.124Z","repository":{"id":299458306,"uuid":"1003116250","full_name":"Djdurga/Predictive_Analysis_in_Diabetes","owner":"Djdurga","description":"This project applies Logistic Regression to predict diabetes in patients using the Pima Indians Diabetes Dataset. It covers the full data science lifecycle — from data imputation and exploration to model training, evaluation, and insights.","archived":false,"fork":false,"pushed_at":"2025-06-16T16:49:27.000Z","size":249,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":0,"default_branch":"main","last_synced_at":"2025-06-16T17:49:27.640Z","etag":null,"topics":["matplotlib","numpy","pandas"],"latest_commit_sha":null,"homepage":"","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/Djdurga.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null}},"created_at":"2025-06-16T16:41:57.000Z","updated_at":"2025-06-16T16:52:59.000Z","dependencies_parsed_at":"2025-06-16T17:49:29.137Z","dependency_job_id":"8fa9d46b-d758-4546-a898-375fcafc6790","html_url":"https://github.com/Djdurga/Predictive_Analysis_in_Diabetes","commit_stats":null,"previous_names":["djdurga/predictive_analysis_in_diabetes"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/Djdurga/Predictive_Analysis_in_Diabetes","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Djdurga%2FPredictive_Analysis_in_Diabetes","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Djdurga%2FPredictive_Analysis_in_Diabetes/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Djdurga%2FPredictive_Analysis_in_Diabetes/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Djdurga%2FPredictive_Analysis_in_Diabetes/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/Djdurga","download_url":"https://codeload.github.com/Djdurga/Predictive_Analysis_in_Diabetes/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Djdurga%2FPredictive_Analysis_in_Diabetes/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":261153721,"owners_count":23116920,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["matplotlib","numpy","pandas"],"created_at":"2025-06-21T16:07:14.611Z","updated_at":"2026-05-08T00:41:47.112Z","avatar_url":"https://github.com/Djdurga.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# 🩺 Predictive Analysis in Diabetes using Logistic Regression\n\n## 📌 Project Overview\n\nThis project applies **Logistic Regression** to predict diabetes in patients using the **Pima Indians Diabetes Dataset**. It demonstrates the full data science workflow — from data imputation and EDA to model training, evaluation, and extracting insights.\n\n---\n\n## 🎯 Objective\n\nTo build a binary classification model that predicts whether a patient has diabetes (`Outcome: 1`) or not (`Outcome: 0`) using key health indicators such as glucose levels, BMI, insulin levels, and age.\n\n---\n\n## 🧪 Dataset Details\n\n- **Source**: [Kaggle - Pima Indians Diabetes Dataset](https://www.kaggle.com/datasets/uciml/pima-indians-diabetes-database)\n- **Total Records**: 768\n- **Target Feature**: `Outcome` (0 = No Diabetes, 1 = Diabetes)\n- **Attributes**:\n  - Pregnancies  \n  - Glucose  \n  - BloodPressure  \n  - SkinThickness  \n  - Insulin  \n  - BMI  \n  - DiabetesPedigreeFunction  \n  - Age  \n\n---\n\n## 📈 Workflow Summary\n\n1. **📥 Data Import \u0026 Exploration**\n   - Load dataset using pandas\n   - Check structure, shape, missing or zero values\n\n2. **🧹 Data Cleaning**\n   - Impute zero values in `Glucose`, `BloodPressure`, `Insulin`, `BMI`, etc.\n\n3. **📊 Exploratory Data Analysis (EDA)**\n   - Summary statistics\n   - Correlation matrix\n   - Visualizations with Seaborn \u0026 Matplotlib\n\n4. **⚙️ Model Building**\n   - Logistic Regression with Scikit-learn\n   - Train-test split\n\n5. **📉 Evaluation**\n   - Accuracy Score\n   - Confusion Matrix\n   - Precision, Recall, F1-Score\n   - ROC-AUC Curve\n\n---\n\n## 🛠 Tools \u0026 Technologies\n\n- Python\n- Pandas, NumPy\n- Matplotlib, Seaborn\n- Scikit-learn\n- Jupyter Notebook\n\n---\n\n## 📂 Repository Structure\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdjdurga%2Fpredictive_analysis_in_diabetes","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fdjdurga%2Fpredictive_analysis_in_diabetes","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdjdurga%2Fpredictive_analysis_in_diabetes/lists"}