https://github.com/mikekeith52/fsu_sql
SQL demo for the Florida State Masters of Applied Economics program.
https://github.com/mikekeith52/fsu_sql
Last synced: 3 months ago
JSON representation
SQL demo for the Florida State Masters of Applied Economics program.
- Host: GitHub
- URL: https://github.com/mikekeith52/fsu_sql
- Owner: mikekeith52
- Created: 2020-04-18T00:09:51.000Z (about 5 years ago)
- Default Branch: master
- Last Pushed: 2024-10-06T21:23:05.000Z (9 months ago)
- Last Synced: 2025-02-09T08:17:17.982Z (5 months ago)
- Homepage:
- Size: 19 MB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# FSU_SQL
Welcome FSU students! Please follow the instructions below to start learning Data Science and SQL:
## Download all needed files from GitHub
1. Click the green code button on the top-right
2. Download all files as a zip
3. Unzip and save the files locally to your computer where you'll be able to easily access them## Create a Databricks Account
Please complete the steps below before our session begins.1. Navigate to https://community.cloud.databricks.com
2. Sign up for a free community account
- Select the option to sign up a new account
- Fill out the form
- For Company, use "Florida State University"
- For Title, use "Student"
- In the next screen select "Get Started with Community Edition"
- Once you confirm your email, you should be able to log into the Databricks platform## Create a Databricks Cluster and SQL Instance
We will follow these instructions together when I arrive on campus. You are free to do all this before I arrive, but because of how Databricks is as a platform, you will need to repeat these steps when we are together.1. Navigate to https://community.cloud.databricks.com
2. Log in
3. Create a compute cluster
- On your account dashboard, click the "Compute" icon on the left
- Select "Create compute" on the top-right
- Use "FSU Cluster" for the compute name
- Any option for the Databricks Runtime Option will work (safest is to use the default one offered to you)
- Create Compute
4. Upload cogsley_clients.csv and cogsley_sales.csv to the default database
- Select the main databricks logo on the top-left
- Select "Create table"
- Upload cogsley_sales.csv
- Create table with UI
- Attach the FSU Cluster
- Preview table
- Change name to cogsley_sales
- Column delimeter is comma (,)
- Check thse boxes:
- First row is header
- Multi-line
- Scroll through the preview data and change all date columns to type "Timestamp" (OrderDate, OrderMonthYear, ProjectCompleteDate)
- Create table
- Upload cogsley_clients.csv and repeate the steps for the last dataset. This time, call the table cogsley_clients. You can also check the "Infer schema" box as there are no columns to change to Timestamp
5. Upload SQL-Practice-FSU.dbc as a notebook
- Select the main databricks logo on the top-left
- Create a new notebook
- Select file--> import
- Select the QL-Practice-FSU.dbc
- On the file icon on the left, select the notebook
6. We will go through each coding block together to thoroughly cover what each is accomplishing## Questions?
Feel free to contact me
- [email protected]
- [LinkedIn](https://www.linkedin.com/in/michaelwkeith/)
- [GitHub](https://github.com/mikekeith52)