https://github.com/soham7998/ipl-data-analysis_using-apache-spark
https://github.com/soham7998/ipl-data-analysis_using-apache-spark
Last synced: 24 days ago
JSON representation
- Host: GitHub
- URL: https://github.com/soham7998/ipl-data-analysis_using-apache-spark
- Owner: soham7998
- Created: 2024-06-08T04:34:26.000Z (12 months ago)
- Default Branch: main
- Last Pushed: 2024-06-08T04:35:49.000Z (12 months ago)
- Last Synced: 2024-06-08T05:34:33.288Z (12 months ago)
- Language: Jupyter Notebook
- Size: 2.59 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# IPL-Data-Analysis_Using Apache-Spark
Here are the things I have done.
•Basics of Apache Spark (architecture, transformation, action, lazy evaluation)
•Creating a Databricks account and the basics of it
•Structured API and how to write transformation functions
•Using SQL to analyze IPL Data
•Building visualization to gain more insights
The goal of this project is to give you an overall understanding of Apache Spark and its different functions to write transformation blocks on top of that you will learn SQL to analyze data and build visualization.
