An open API service indexing awesome lists of open source software.

https://github.com/mikekeith52/fsu_sql

SQL demo for the Florida State Masters of Applied Economics program.
https://github.com/mikekeith52/fsu_sql

Last synced: 3 months ago
JSON representation

SQL demo for the Florida State Masters of Applied Economics program.

Awesome Lists containing this project

README

        

# FSU_SQL

Welcome FSU students! Please follow the instructions below to start learning Data Science and SQL:

## Download all needed files from GitHub
1. Click the green code button on the top-right
2. Download all files as a zip
3. Unzip and save the files locally to your computer where you'll be able to easily access them

## Create a Databricks Account
Please complete the steps below before our session begins.

1. Navigate to https://community.cloud.databricks.com
2. Sign up for a free community account
- Select the option to sign up a new account
- Fill out the form
- For Company, use "Florida State University"
- For Title, use "Student"
- In the next screen select "Get Started with Community Edition"
- Once you confirm your email, you should be able to log into the Databricks platform

## Create a Databricks Cluster and SQL Instance
We will follow these instructions together when I arrive on campus. You are free to do all this before I arrive, but because of how Databricks is as a platform, you will need to repeat these steps when we are together.

1. Navigate to https://community.cloud.databricks.com
2. Log in
3. Create a compute cluster
- On your account dashboard, click the "Compute" icon on the left
- Select "Create compute" on the top-right
- Use "FSU Cluster" for the compute name
- Any option for the Databricks Runtime Option will work (safest is to use the default one offered to you)
- Create Compute
4. Upload cogsley_clients.csv and cogsley_sales.csv to the default database
- Select the main databricks logo on the top-left
- Select "Create table"
- Upload cogsley_sales.csv
- Create table with UI
- Attach the FSU Cluster
- Preview table
- Change name to cogsley_sales
- Column delimeter is comma (,)
- Check thse boxes:
- First row is header
- Multi-line
- Scroll through the preview data and change all date columns to type "Timestamp" (OrderDate, OrderMonthYear, ProjectCompleteDate)
- Create table
- Upload cogsley_clients.csv and repeate the steps for the last dataset. This time, call the table cogsley_clients. You can also check the "Infer schema" box as there are no columns to change to Timestamp
5. Upload SQL-Practice-FSU.dbc as a notebook
- Select the main databricks logo on the top-left
- Create a new notebook
- Select file--> import
- Select the QL-Practice-FSU.dbc
- On the file icon on the left, select the notebook
6. We will go through each coding block together to thoroughly cover what each is accomplishing

## Questions?
Feel free to contact me
- [email protected]
- [LinkedIn](https://www.linkedin.com/in/michaelwkeith/)
- [GitHub](https://github.com/mikekeith52)