Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/joyceannie/reddit_data_pipeline
The purpose of the project is to create a data pipeline to extract data from Reddit API and create a dashboard to analyse the data. The data is extracted from the subreddit r/Python. The data is extracted daily and uploaded to S3 buckets, and copied to Redshift. The dashboard is created using Google Data Studio.
https://github.com/joyceannie/reddit_data_pipeline
airflow aws etl python redshift s3 terraform
Last synced: about 1 month ago
JSON representation
The purpose of the project is to create a data pipeline to extract data from Reddit API and create a dashboard to analyse the data. The data is extracted from the subreddit r/Python. The data is extracted daily and uploaded to S3 buckets, and copied to Redshift. The dashboard is created using Google Data Studio.
- Host: GitHub
- URL: https://github.com/joyceannie/reddit_data_pipeline
- Owner: joyceannie
- Created: 2022-08-16T22:28:20.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2022-08-27T16:34:00.000Z (over 2 years ago)
- Last Synced: 2023-03-05T11:33:09.257Z (almost 2 years ago)
- Topics: airflow, aws, etl, python, redshift, s3, terraform
- Language: Python
- Homepage:
- Size: 261 KB
- Stars: 2
- Watchers: 2
- Forks: 0
- Open Issues: 0