https://github.com/nabojyoti/elt-ipl
This is an End-to-End Data Engineering Project that using the IPL Dataset.
https://github.com/nabojyoti/elt-ipl
apache-spark databricks-notebooks pyspark snowflake snowsql
Last synced: 5 months ago
JSON representation
This is an End-to-End Data Engineering Project that using the IPL Dataset.
- Host: GitHub
- URL: https://github.com/nabojyoti/elt-ipl
- Owner: nabojyoti
- Created: 2024-04-29T04:09:18.000Z (about 2 years ago)
- Default Branch: main
- Last Pushed: 2024-04-29T04:20:36.000Z (about 2 years ago)
- Last Synced: 2025-02-10T07:44:12.864Z (over 1 year ago)
- Topics: apache-spark, databricks-notebooks, pyspark, snowflake, snowsql
- Language: Jupyter Notebook
- Homepage:
- Size: 1.67 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# IPL Data Engineering
This is an End-to-End Data Engineering Project that using the IPL Dataset.
## Pipeline Architecture
[](https://postimg.cc/FfYZzvxb)
## Screenshots
### Snowflake Environment
[](https://postimg.cc/9zYNvWgz)
### Databricks Environment
[](https://postimg.cc/Z0wx80DP)
### Data Visualization
[](https://postimg.cc/644q4Hmm)
[](https://postimg.cc/s1GgcNTn)
## Tech Stack
**Dataset:** IPL-dataset from @data.world
**Extract & Load Data:** SnowSQL & Snowflake DB
**Transform Data:** Pyspark & Databricks