Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/andrewjmack/home_sales
SparkSQL to determine key metrics about home sales data. Spark to create temporary views, partition the data, cache and uncache a temporary table, and verify that the table was uncached.
https://github.com/andrewjmack/home_sales
jupyter-notebook pandas python spark sql
Last synced: about 1 month ago
JSON representation
SparkSQL to determine key metrics about home sales data. Spark to create temporary views, partition the data, cache and uncache a temporary table, and verify that the table was uncached.
- Host: GitHub
- URL: https://github.com/andrewjmack/home_sales
- Owner: andrewjmack
- Created: 2024-07-26T00:23:37.000Z (4 months ago)
- Default Branch: main
- Last Pushed: 2024-07-29T22:40:01.000Z (4 months ago)
- Last Synced: 2024-09-29T07:01:38.003Z (about 2 months ago)
- Topics: jupyter-notebook, pandas, python, spark, sql
- Language: Jupyter Notebook
- Homepage:
- Size: 329 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Home_Sales
Univ of Denver: Data Analytics | July 2024 | Andrew Mack### Table of Contents:
1. [Summary](#Summary)
2. [Contents](#Contents)
3. [Questions](#Questions)
4. [References](#References)### Summary
The purpose of this project was to utilize Spark to read in a home sales data set, create temporary views and tables, and to execute SQL queries answer key questions with the data.
### Contents
- Home_Sales.ipynb
- README.md
- pngThis repository includes the Jupyter Notebook in which the data set was analyzed. The .CSV and partitioned Parquet files have been omitted from the repository for size considerations.
### Questions
#### 1. What is the average price for a four-bedroom house sold for each year?
![alt-text](https://github.com/andrewjmack/Home_Sales/blob/main/png/1.png)
#### 2. What is the average price of a home for each year the home was built, that has three bedrooms and three bathrooms?
![alt-text](https://github.com/andrewjmack/Home_Sales/blob/main/png/2.png)
#### 3. What is the average price of a home for each year the home was built, that has three bedrooms, three bathrooms, two floors, and is greater than or equal to 2,000 square feet?
![alt-text](https://github.com/andrewjmack/Home_Sales/blob/main/png/3.png)
#### 4. What is the average price of a home per "view" rating having an average home price greater than or equal to $350,000?
![alt-text](https://github.com/andrewjmack/Home_Sales/blob/main/png/4.png)
### References
- Data for this dataset was generated by edX Boot Camps LLC, and is intended for educational purposes only.