An open API service indexing awesome lists of open source software.

https://github.com/armahdavi/bigdata_pyspark_sales_analytics

Summarizing my big data code in python pyspark to analyze sales data with retail and walmart superstore to draw sales insights
https://github.com/armahdavi/bigdata_pyspark_sales_analytics

big-data bigquery clustering dataframe hadoop k-means machine-learning pyspark pyspark-ml python spark unsupervised-learning

Last synced: 2 months ago
JSON representation

Summarizing my big data code in python pyspark to analyze sales data with retail and walmart superstore to draw sales insights

Awesome Lists containing this project

README

          

# Big Data: PySpark and H2O.ai

This repository is a collection of my Big Data work in Sales Analytics (including Machine Learning modeling) using the two most famous Big Data platforms in Python: PySpark (the Spark version of Python) and H2O.ai.