Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/najuzilu/dl-spark
Building a Data Lake with Spark
https://github.com/najuzilu/dl-spark
aws-emr aws-s3 data-engineering data-lake etl-pipeline spark
Last synced: 18 days ago
JSON representation
Building a Data Lake with Spark
- Host: GitHub
- URL: https://github.com/najuzilu/dl-spark
- Owner: najuzilu
- License: mit
- Created: 2021-07-29T00:48:30.000Z (over 3 years ago)
- Default Branch: main
- Last Pushed: 2021-08-01T21:30:40.000Z (over 3 years ago)
- Last Synced: 2024-11-28T11:15:00.768Z (3 months ago)
- Topics: aws-emr, aws-s3, data-engineering, data-lake, etl-pipeline, spark
- Language: Python
- Homepage:
- Size: 894 KB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0