An open API service indexing awesome lists of open source software.

https://github.com/ralstonraphael/water_access_ml_project

This project analyzes life expectancy data from the World Health Organization (WHO), sourced from Kaggle. The dataset spans 183 countries across 6 regions, covering metrics such as life expectancy, mortality rates, healthcare access, and socioeconomic factors.
https://github.com/ralstonraphael/water_access_ml_project

data-science machine-learning numpy pandas python

Last synced: 7 months ago
JSON representation

This project analyzes life expectancy data from the World Health Organization (WHO), sourced from Kaggle. The dataset spans 183 countries across 6 regions, covering metrics such as life expectancy, mortality rates, healthcare access, and socioeconomic factors.

Awesome Lists containing this project

README

          

# ๐ŸŒ Life Expectancy Analysis with WHO Data

## ๐Ÿ“„ Project Overview
This project explores and analyzes life expectancy data provided by the World Health Organization (sourced from Kaggle). Using Python's powerful data science ecosystem, we clean, transform, and visualize insights about global health trends. We then apply machine learning techniques to predict life expectancy based on socioeconomic and health factors.

---

## ๐Ÿงช Tech Stack
- Python
- Pandas, Numpy
- Matplotlib, Seaborn
- Scikit-learn
- TensorFlow/Keras

---

## ๐Ÿง  Key Learnings
- Mastery in data preprocessing: handling missing values, normalization, and encoding.
- Feature engineering to enhance model inputs.
- Exploratory Data Analysis (EDA) with Matplotlib and Seaborn.
- Training and evaluating machine learning models using TensorFlow/Keras.
- Interpretation of results to derive health policy insights.

---

## ๐Ÿ” Key Insights
- **GDP and schooling** are strongly positively correlated with higher life expectancy.
- **HIV/AIDS prevalence** shows a strong negative correlation.
- Countries with better **healthcare expenditure** generally enjoy higher life expectancy.
- Model performance indicates socioeconomic indicators can be good predictors of life expectancy, though regional anomalies exist.

---

## ๐Ÿงพ How to Run

```bash
# Install required libraries
pip install pandas numpy matplotlib seaborn scikit-learn tensorflow