Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/arnoldchrisoduor1/tokyo-olympics-data-engineering-pipeline-azure
End to End data Engineering Pipeline Using Microsoft Azure.
https://github.com/arnoldchrisoduor1/tokyo-olympics-data-engineering-pipeline-azure
apache-spark azure-data-factory azure-datalake azure-storage azure-synapse databricks python sql
Last synced: 3 months ago
JSON representation
End to End data Engineering Pipeline Using Microsoft Azure.
- Host: GitHub
- URL: https://github.com/arnoldchrisoduor1/tokyo-olympics-data-engineering-pipeline-azure
- Owner: arnoldchrisoduor1
- Created: 2024-07-18T12:41:56.000Z (6 months ago)
- Default Branch: main
- Last Pushed: 2024-07-18T17:58:57.000Z (6 months ago)
- Last Synced: 2024-09-29T07:02:17.954Z (3 months ago)
- Topics: apache-spark, azure-data-factory, azure-datalake, azure-storage, azure-synapse, databricks, python, sql
- Language: Jupyter Notebook
- Homepage:
- Size: 478 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Tokyo Olympics Data Engineering Pipeline in Azure
![Screenshot 2024-07-18 201559](https://github.com/user-attachments/assets/a8fc909f-65f2-44a9-8e9f-87804d428ca8)
## The picture shows how data moved through the pipeline
- Data source was from this githubrepo/data ingested by azure data factory.
Aim of the project was to build the data pipeline and not necessarily the data transformations present in the notebook.
Follow me @ `arnold0duor`