Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/andrewjmack/home_sales

SparkSQL to determine key metrics about home sales data. Spark to create temporary views, partition the data, cache and uncache a temporary table, and verify that the table was uncached.
https://github.com/andrewjmack/home_sales

jupyter-notebook pandas python spark sql

Last synced: about 1 month ago
JSON representation

SparkSQL to determine key metrics about home sales data. Spark to create temporary views, partition the data, cache and uncache a temporary table, and verify that the table was uncached.

Awesome Lists containing this project

README

        

# Home_Sales
Univ of Denver: Data Analytics | July 2024 | Andrew Mack

### Table of Contents:

1. [Summary](#Summary)
2. [Contents](#Contents)
3. [Questions](#Questions)
4. [References](#References)

### Summary

The purpose of this project was to utilize Spark to read in a home sales data set, create temporary views and tables, and to execute SQL queries answer key questions with the data.

### Contents

- Home_Sales.ipynb
- README.md
- png

This repository includes the Jupyter Notebook in which the data set was analyzed. The .CSV and partitioned Parquet files have been omitted from the repository for size considerations.

### Questions

#### 1. What is the average price for a four-bedroom house sold for each year?

![alt-text](https://github.com/andrewjmack/Home_Sales/blob/main/png/1.png)

#### 2. What is the average price of a home for each year the home was built, that has three bedrooms and three bathrooms?

![alt-text](https://github.com/andrewjmack/Home_Sales/blob/main/png/2.png)

#### 3. What is the average price of a home for each year the home was built, that has three bedrooms, three bathrooms, two floors, and is greater than or equal to 2,000 square feet?

![alt-text](https://github.com/andrewjmack/Home_Sales/blob/main/png/3.png)

#### 4. What is the average price of a home per "view" rating having an average home price greater than or equal to $350,000?

![alt-text](https://github.com/andrewjmack/Home_Sales/blob/main/png/4.png)

### References

- Data for this dataset was generated by edX Boot Camps LLC, and is intended for educational purposes only.