https://github.com/gakas14/formula1_data_modeling
This project involves getting different datasets, creating a data model, creating tables, creating relationships between tables, and then loading the data into Snowflake.
https://github.com/gakas14/formula1_data_modeling
data-engineering data-modeling entity-relationship-diagram pandas-dataframe python snowflake sql
Last synced: about 2 months ago
JSON representation
This project involves getting different datasets, creating a data model, creating tables, creating relationships between tables, and then loading the data into Snowflake.
- Host: GitHub
- URL: https://github.com/gakas14/formula1_data_modeling
- Owner: gakas14
- Created: 2024-03-08T08:20:16.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2024-03-12T03:41:40.000Z (over 2 years ago)
- Last Synced: 2024-12-31T12:34:04.264Z (over 1 year ago)
- Topics: data-engineering, data-modeling, entity-relationship-diagram, pandas-dataframe, python, snowflake, sql
- Language: Jupyter Notebook
- Homepage:
- Size: 422 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Formula1_Data_Modeling
### In this project, we have a Formula 1 dataset covering races, drivers, constructors, qualifying, circuits, lap times, pit stops, and championships from 1950 to the 2023 season. The data needs to be loaded into a Snowflake database.
#### 1. We first load the different datasets into Python and then process some transformations.
#### 2. We create a connection between Python and Snowflake and create a database.

#### 3. We create a Snowflake table using Python.

#### 4. We Build a data model.

#### 5. We load the different data frames into a Snowflake.
