https://github.com/y-t-g/food-prices-eda
Exploratory data analysis of food prices in Malaysia using PySpark
https://github.com/y-t-g/food-prices-eda
Last synced: 3 months ago
JSON representation
Exploratory data analysis of food prices in Malaysia using PySpark
- Host: GitHub
- URL: https://github.com/y-t-g/food-prices-eda
- Owner: Y-T-G
- Created: 2021-11-11T16:37:38.000Z (over 3 years ago)
- Default Branch: main
- Last Pushed: 2021-11-19T09:48:20.000Z (over 3 years ago)
- Last Synced: 2025-02-08T21:26:15.121Z (4 months ago)
- Language: Jupyter Notebook
- Size: 126 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Food Prices EDA
[](https://colab.research.google.com/github/Y-T-G/Food-Prices-EDA/blob/master/food_prices_eda.ipynb)
This notebook aims to carry out an exploratory data analysis of producer food prices in Malaysia using the [dataset](https://data.humdata.org/dataset/8147f617-b323-40ea-a03f-5c21db2c62bb/resource/69b1b7b8-701a-44bf-816b-6a07b78fbc77) provided by The Humanitarian Data Exchange (HDX). It contains data about the Malaysian producer food prices recorded over almost three decades. Apache Spark (through the PySpark interface) was used to carry out the required data-wrangling while Seaborn was used to create the visualizations.
# Resources
Dataset - [Producer Prices data for Malaysia](https://data.humdata.org/dataset/8147f617-b323-40ea-a03f-5c21db2c62bb/resource/69b1b7b8-701a-44bf-816b-6a07b78fbc77)