An open API service indexing awesome lists of open source software.

https://github.com/wlopezm-unal/reddit_project_airflow_aws

This project focuses on implementing an ETL pipeline using Apache Airflow to efficiently extract data from Reddit, transform it as needed, and load it into an AWS S3 bucket. The use of Airflow allows for robust orchestration of the data workflow, ensuring that each step of the ETL process is executed in a reliable and repeatable manner.
https://github.com/wlopezm-unal/reddit_project_airflow_aws

airflow airflow-dag airflow-docker aws docker-compose git python terraform

Last synced: 3 months ago
JSON representation

This project focuses on implementing an ETL pipeline using Apache Airflow to efficiently extract data from Reddit, transform it as needed, and load it into an AWS S3 bucket. The use of Airflow allows for robust orchestration of the data workflow, ensuring that each step of the ETL process is executed in a reliable and repeatable manner.

Awesome Lists containing this project