An open API service indexing awesome lists of open source software.

https://github.com/y-t-g/food-prices-eda

Exploratory data analysis of food prices in Malaysia using PySpark
https://github.com/y-t-g/food-prices-eda

Last synced: 3 months ago
JSON representation

Exploratory data analysis of food prices in Malaysia using PySpark

Awesome Lists containing this project

README

        

# Food Prices EDA

[![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/Y-T-G/Food-Prices-EDA/blob/master/food_prices_eda.ipynb)

This notebook aims to carry out an exploratory data analysis of producer food prices in Malaysia using the [dataset](https://data.humdata.org/dataset/8147f617-b323-40ea-a03f-5c21db2c62bb/resource/69b1b7b8-701a-44bf-816b-6a07b78fbc77) provided by The Humanitarian Data Exchange (HDX). It contains data about the Malaysian producer food prices recorded over almost three decades. Apache Spark (through the PySpark interface) was used to carry out the required data-wrangling while Seaborn was used to create the visualizations.

# Resources

Dataset - [Producer Prices data for Malaysia](https://data.humdata.org/dataset/8147f617-b323-40ea-a03f-5c21db2c62bb/resource/69b1b7b8-701a-44bf-816b-6a07b78fbc77)