An open API service indexing awesome lists of open source software.

https://github.com/kanasia-moore/sql-project


https://github.com/kanasia-moore/sql-project

bigquery data-analysis data-analysis-project data-analytics dataset homelessness sql

Last synced: 9 months ago
JSON representation

Awesome Lists containing this project

README

          

# BigQuery Data Exploration

## Overview

This project explores the public dataset using BigQuery and SQL queries to gain insights on the amount of homeless per state and the type and amount of shelters provided.

## Data

### Data Sources

This project uses the data collected from the BigQuery public dataset on homelessness:

https://console.cloud.google.com/bigquery?ws=!1m5!1m4!4m3!1sbigquery-public-data!2ssdoh_hud_pit_homelessness!3shud_pit_by_coc

### Data Description

Information about the dataset:
- **CoC** = Continuum of Care
- **Sheltered_ES** = Emergency Shelter
- **Sheltered_TH** = Transitional Housing
- **Sheltered_SH** = Safe Haven Housing
- The dataset ranges between 2012 and 2018

## Methodology

### Analysis Approach

- Previewed the table to understand key data such as:
- **Column Names:** Checked for descriptive columns and acronyms
- **Data types:** Made sure data stored was consistent with column data type
- **Completness:** Looked for missing or null values and checked for accuracy and reliability

- Created a table with the relevant data needed for analysis:
```sql
CREATE TABLE
Exploration_Project.homelessness AS
SELECT
CoC_Number,
LEFT(CoC_Number, 2) AS State,
CoC_Name,
Overall_Homeless,
Sheltered_ES_Homeless,
Sheltered_TH_Homeless,
Sheltered_SH_Homeless,
Sheltered_Total_Homeless,
Unsheltered_Homeless,
Homeless_Individuals,
Homeless_People_in_Families,
Chronically_Homeless,
Homeless_Veterans,
Homeless_Unaccompanied_Youth_Under_18,
Count_Year
FROM
`bigquery-public-data.sdoh_hud_pit_homelessness.hud_pit_by_coc`
```

### Tools and Technologies

- BigQuery
- SQL