https://github.com/alien-droid/demographic-analysis
https://github.com/alien-droid/demographic-analysis
Last synced: 12 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/alien-droid/demographic-analysis
- Owner: alien-droid
- Created: 2023-05-06T13:30:35.000Z (about 3 years ago)
- Default Branch: main
- Last Pushed: 2023-05-07T17:00:39.000Z (about 3 years ago)
- Last Synced: 2025-06-17T04:06:08.607Z (about 1 year ago)
- Language: Jupyter Notebook
- Size: 120 KB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Demographic-Analysis
For reading the dataset, we recommend placing the '.csv' file (from Kaggle) within a `data` folder inside the `resources` folder.
## Problem Statement
The study conducts an analysis of a sample of
the population with the help of Big Data technolo-
gies in the demographics realm. This demographic
analysis explores a dataset of deceased famous in-
dividuals, extracted from Wikipedia containing in-
formation regarding the work, age, and death of in-
dividuals.
## Analysis and PySpark
The
analysis was conducted using PySpark, a powerful
distributed computing framework for large-scale
data, followed by a few preprocessing steps, and
then performing Exploratory Data Analysis (EDA)
was conducted to gain insights and distribution of
various predictors within the dataset.
## Dataset
[Kaggle](https://www.kaggle.com/datasets/imoore/age-dataset)