https://github.com/wlopezm-unal/reddit_project_airflow_aws
This project focuses on implementing an ETL pipeline using Apache Airflow to efficiently extract data from Reddit, transform it as needed, and load it into an AWS S3 bucket. The use of Airflow allows for robust orchestration of the data workflow, ensuring that each step of the ETL process is executed in a reliable and repeatable manner.
https://github.com/wlopezm-unal/reddit_project_airflow_aws
airflow airflow-dag airflow-docker aws docker-compose git python terraform
Last synced: 3 months ago
JSON representation
This project focuses on implementing an ETL pipeline using Apache Airflow to efficiently extract data from Reddit, transform it as needed, and load it into an AWS S3 bucket. The use of Airflow allows for robust orchestration of the data workflow, ensuring that each step of the ETL process is executed in a reliable and repeatable manner.
- Host: GitHub
- URL: https://github.com/wlopezm-unal/reddit_project_airflow_aws
- Owner: wlopezm-unal
- Created: 2024-10-29T02:05:03.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2024-10-30T02:03:26.000Z (over 1 year ago)
- Last Synced: 2025-10-12T01:09:47.941Z (9 months ago)
- Topics: airflow, airflow-dag, airflow-docker, aws, docker-compose, git, python, terraform
- Language: Python
- Homepage:
- Size: 38.1 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md