https://github.com/andrewjmack/home_sales
SparkSQL to determine key metrics about home sales data. Spark to create temporary views, partition the data, cache and uncache a temporary table, and verify that the table was uncached.
https://github.com/andrewjmack/home_sales
jupyter-notebook pandas python spark sql
Last synced: 4 months ago
JSON representation
SparkSQL to determine key metrics about home sales data. Spark to create temporary views, partition the data, cache and uncache a temporary table, and verify that the table was uncached.
- Host: GitHub
- URL: https://github.com/andrewjmack/home_sales
- Owner: andrewjmack
- Created: 2024-07-26T00:23:37.000Z (almost 2 years ago)
- Default Branch: main
- Last Pushed: 2024-07-29T22:40:01.000Z (almost 2 years ago)
- Last Synced: 2025-03-06T17:15:41.044Z (over 1 year ago)
- Topics: jupyter-notebook, pandas, python, spark, sql
- Language: Jupyter Notebook
- Homepage:
- Size: 329 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Home_Sales
Univ of Denver: Data Analytics | July 2024 | Andrew Mack
### Table of Contents:
1. [Summary](#Summary)
2. [Contents](#Contents)
3. [Questions](#Questions)
4. [References](#References)
### Summary
The purpose of this project was to utilize Spark to read in a home sales data set, create temporary views and tables, and to execute SQL queries answer key questions with the data.
### Contents
- Home_Sales.ipynb
- README.md
- png
This repository includes the Jupyter Notebook in which the data set was analyzed. The .CSV and partitioned Parquet files have been omitted from the repository for size considerations.
### Questions
#### 1. What is the average price for a four-bedroom house sold for each year?

#### 2. What is the average price of a home for each year the home was built, that has three bedrooms and three bathrooms?

#### 3. What is the average price of a home for each year the home was built, that has three bedrooms, three bathrooms, two floors, and is greater than or equal to 2,000 square feet?

#### 4. What is the average price of a home per "view" rating having an average home price greater than or equal to $350,000?

### References
- Data for this dataset was generated by edX Boot Camps LLC, and is intended for educational purposes only.