An open API service indexing awesome lists of open source software.

https://github.com/joyceannie/reddit_data_pipeline

The purpose of the project is to create a data pipeline to extract data from Reddit API and create a dashboard to analyse the data. The data is extracted from the subreddit r/Python. The data is extracted daily and uploaded to S3 buckets, and copied to Redshift. The dashboard is created using Google Data Studio.
https://github.com/joyceannie/reddit_data_pipeline

airflow aws etl python redshift s3 terraform

Last synced: 4 months ago
JSON representation

The purpose of the project is to create a data pipeline to extract data from Reddit API and create a dashboard to analyse the data. The data is extracted from the subreddit r/Python. The data is extracted daily and uploaded to S3 buckets, and copied to Redshift. The dashboard is created using Google Data Studio.

Awesome Lists containing this project