https://github.com/san089/yelp_project
This project is to create a Data lake for Yelp data-set and further using the it to create an Analytical Sandbox Data Science purpose and also creating a data warehouse for reporting purpose.
https://github.com/san089/yelp_project
data-lake data-pipeline etl etl-pipeline ingestion load pyspark recommender-system redshift
Last synced: 7 months ago
JSON representation
This project is to create a Data lake for Yelp data-set and further using the it to create an Analytical Sandbox Data Science purpose and also creating a data warehouse for reporting purpose.
- Host: GitHub
- URL: https://github.com/san089/yelp_project
- Owner: san089
- License: gpl-3.0
- Created: 2018-12-28T05:40:57.000Z (almost 7 years ago)
- Default Branch: master
- Last Pushed: 2019-07-26T05:30:25.000Z (about 6 years ago)
- Last Synced: 2025-01-17T04:46:04.538Z (9 months ago)
- Topics: data-lake, data-pipeline, etl, etl-pipeline, ingestion, load, pyspark, recommender-system, redshift
- Language: Jupyter Notebook
- Size: 351 KB
- Stars: 2
- Watchers: 2
- Forks: 2
- Open Issues: 0
-
Metadata Files:
- License: LICENSE