Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/kmohamedalie/apachespark-data_analytics
Data Analytics with Apache Spark ⭐
https://github.com/kmohamedalie/apachespark-data_analytics
apache-spark data-analytics data-engineering jupyter-notebook pyspark sparksql
Last synced: 3 days ago
JSON representation
Data Analytics with Apache Spark ⭐
- Host: GitHub
- URL: https://github.com/kmohamedalie/apachespark-data_analytics
- Owner: Kmohamedalie
- Created: 2024-06-11T08:15:59.000Z (5 months ago)
- Default Branch: master
- Last Pushed: 2024-06-11T09:06:03.000Z (5 months ago)
- Last Synced: 2024-10-19T03:12:19.841Z (about 1 month ago)
- Topics: apache-spark, data-analytics, data-engineering, jupyter-notebook, pyspark, sparksql
- Language: Jupyter Notebook
- Homepage:
- Size: 487 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Data Analytics with Apache Spark ⭐
1. Setting up spark locally using terminal
2. Running pyspark jobs on jupyterNotebook
3. Visualizing spark jobs
4. Running SQL and pandas like commands
### **Terminal and Spark dashboard:**
![image](https://github.com/Kmohamedalie/ApacheSpark-Data_Analytics/assets/63104472/b8d945d7-4f47-42ff-9123-69fc57f0842a)