https://github.com/kmohamedalie/sparkml-projects
https://github.com/kmohamedalie/sparkml-projects
Last synced: 8 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/kmohamedalie/sparkml-projects
- Owner: Kmohamedalie
- Created: 2024-07-05T16:01:15.000Z (over 1 year ago)
- Default Branch: master
- Last Pushed: 2024-07-05T18:41:31.000Z (over 1 year ago)
- Last Synced: 2025-01-02T15:31:23.572Z (9 months ago)
- Language: Jupyter Notebook
- Size: 27.3 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# [Machine Learning with Apache Spark by IBM](https://www.coursera.org/learn/machine-learning-with-apache-spark/ungradedLti/6Zy8D/practice-project-create-a-machine-learning-pipeline-for-a-regression-project)
## **[Practice Case](https://www.coursera.org/learn/machine-learning-with-apache-spark/supplement/VrMkz/practice-project-overview)**
Welcome to this [practice project](https://github.com/Kmohamedalie/sparkML-projects/blob/master/Practice_Project.ipynb), where you will create an end-to-end solution using machine learning pipelines for regression. Your objective is to clean the dataset, create a model that predicts the SoundLevel based on the other columns, evaluate its performance, and persist the model for future use.
## **[Final Case](https://www.coursera.org/learn/machine-learning-with-apache-spark/supplement/DCKKu/final-project-overview)**
Welcome to the [final project](https://github.com/Kmohamedalie/sparkML-projects/blob/master/Final_Project.ipynb) of this course! In this project, you will work with the modified version of the NASA Airfoil Self Noise dataset. The goal is to clean the dataset, create a Machine Learning pipeline, evaluate the model's performance, and persist it for future use.