Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/mikeacosta/data-lake-spark
Data lake ETL pipeline in Apache Spark
https://github.com/mikeacosta/data-lake-spark
apache-spark aws emr s3
Last synced: 19 days ago
JSON representation
Data lake ETL pipeline in Apache Spark
- Host: GitHub
- URL: https://github.com/mikeacosta/data-lake-spark
- Owner: mikeacosta
- Created: 2020-01-27T08:14:52.000Z (almost 5 years ago)
- Default Branch: master
- Last Pushed: 2020-01-27T08:24:42.000Z (almost 5 years ago)
- Last Synced: 2023-04-03T20:58:08.827Z (over 1 year ago)
- Topics: apache-spark, aws, emr, s3
- Language: Python
- Homepage:
- Size: 109 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0