An open API service indexing awesome lists of open source software.

https://github.com/vaibhavbansal26/data-pipeline-with-airflow

Data Pipeline With Airflow, AWS
https://github.com/vaibhavbansal26/data-pipeline-with-airflow

Last synced: about 2 months ago
JSON representation

Data Pipeline With Airflow, AWS

Awesome Lists containing this project

README

        

# Data-Pipeline-With-Airflow
Data Pipeline With Airflow, AWS

A music streaming company, Sparkify, has decided that it is time to introduce more automation and monitoring to their data warehouse ETL pipelines and come to the conclusion that the best tool to achieve this is Apache Airflow.

In this project, Need to create custom operators to perform tasks such as staging the data, filling the data warehouse, and running checks on the data as the final step.

![Description](./airflow/images/flow.png)

Main file -> Airflow -> dags

Other file -> Airflow -> Plugins -> 1. helpers
2. operators

- Copying Data from S3

- Configuring Redshift

- Configuring Airflow

- Setting Up Airflow Connections with AWS

- Creating the Dag