An open API service indexing awesome lists of open source software.

https://github.com/noahgift/databricks-zero-to-mlops


https://github.com/noahgift/databricks-zero-to-mlops

Last synced: 3 months ago
JSON representation

Awesome Lists containing this project

README

          

# databricks-zero-to-mlops

A four week course on MLOps with Spark and Databricks. By the end of this course, you will be able to develop ML models using standardized workflows for data processing and model management and maintenance using open-source tools. This is accomplished by using PySpark and MLflow to build scalable and reproducible solutions using MLOps methodologies.

## Syllabus

* [syllabus](docs/syllabus.md)

## Demos and Deliverables

This course includes a weekly demo video assignment. You will report on the progress of your ongoing project with a 30-90 second video, including screen-casting of your progress. The reason for this weekly demo is to increase metacognition as well teach other students what you now, thus increasing the learning experience for everyone.

## Artifacts

## Datasets

* [Azure Open Datasets](https://docs.microsoft.com/en-us/azure/open-datasets/dataset-catalog#AzureDatabricks)
* [Example Open Dataset (TartanAir)](https://docs.microsoft.com/en-us/azure/open-datasets/dataset-tartanair-simulation)
* [Example Open Dataset NyTaxi](https://github.com/MicrosoftDocs/azure-docs/blob/main/articles/machine-learning/tutorial-auto-train-models.md?context=azure%2fopen-datasets%2fcontext%2fopen-datasets-context)
* [Fake News Classification](https://www.kaggle.com/ruchi798/source-based-news-classification)
* [Social Power NBA](https://github.com/noahgift/socialpowernba/blob/master/data/nba_2017_players_with_salary_wiki_twitter.csv)

## Pre-work before start

* [Need Free Azure Account for Databricks](https://azure.microsoft.com/free/)
* [Follow Azure Databricks Quickstart](https://docs.microsoft.com/en-us/azure/databricks/scenarios/quickstart-create-databricks-workspace-portal?tabs=azure-portal)

## References

* [MLOps Platforms YouTube Video](https://www.youtube.com/watch?v=PUXhWZQW8BI)
* [MLOps Platforms O'Reilly Video](https://learning.oreilly.com/videos/mlops-platforms-from/032232022VIDEOPAIML/)
* [Azure Databricks](https://azure.microsoft.com/en-us/services/databricks/)
* [Cloud Computing for Data Duke Course](https://github.com/noahgift/cloud-data-analysis-at-scale)
* [Learning Spark 2nd Edition](https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf)