https://github.com/martachesnova/big-data
Finding out whether reviews from Amazon's Vine program are trustworthy. Performed ETL process in the Cloud and uploaded a DataFrame to an RDS instance. Used PySpark and Spark SQL to perform a statistical analysis and uncover "hidden" insights.
https://github.com/martachesnova/big-data
big-data data-analysis dataset python spark sql
Last synced: 2 months ago
JSON representation
Finding out whether reviews from Amazon's Vine program are trustworthy. Performed ETL process in the Cloud and uploaded a DataFrame to an RDS instance. Used PySpark and Spark SQL to perform a statistical analysis and uncover "hidden" insights.
- Host: GitHub
- URL: https://github.com/martachesnova/big-data
- Owner: martachesnova
- Created: 2021-11-08T23:09:44.000Z (over 4 years ago)
- Default Branch: main
- Last Pushed: 2021-11-19T09:02:30.000Z (over 4 years ago)
- Last Synced: 2025-01-06T10:13:38.904Z (over 1 year ago)
- Topics: big-data, data-analysis, dataset, python, spark, sql
- Language: Jupyter Notebook
- Homepage:
- Size: 178 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0