https://github.com/armahdavi/bigdata_pyspark_sales_analytics
Summarizing my big data code in python pyspark to analyze sales data with retail and walmart superstore to draw sales insights
https://github.com/armahdavi/bigdata_pyspark_sales_analytics
big-data bigquery clustering dataframe hadoop k-means machine-learning pyspark pyspark-ml python spark unsupervised-learning
Last synced: 2 months ago
JSON representation
Summarizing my big data code in python pyspark to analyze sales data with retail and walmart superstore to draw sales insights
- Host: GitHub
- URL: https://github.com/armahdavi/bigdata_pyspark_sales_analytics
- Owner: armahdavi
- Created: 2024-12-25T19:27:27.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2024-12-25T21:18:04.000Z (over 1 year ago)
- Last Synced: 2025-02-19T02:14:47.848Z (over 1 year ago)
- Topics: big-data, bigquery, clustering, dataframe, hadoop, k-means, machine-learning, pyspark, pyspark-ml, python, spark, unsupervised-learning
- Language: Jupyter Notebook
- Homepage:
- Size: 41 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Big Data: PySpark and H2O.ai
This repository is a collection of my Big Data work in Sales Analytics (including Machine Learning modeling) using the two most famous Big Data platforms in Python: PySpark (the Spark version of Python) and H2O.ai.