{"id":50596353,"url":"https://github.com/dmatrix/mlflow-workshop-part-1","last_synced_at":"2026-06-05T14:31:56.139Z","repository":{"id":50377471,"uuid":"255954568","full_name":"dmatrix/mlflow-workshop-part-1","owner":"dmatrix","description":"Partly lecture and partly a hands-on tutorial and workshop, this is a three part series on how to get started with MLflow. In this three part series, we will cover MLflow Tracking, Projects, Models, and Model Registry. ","archived":false,"fork":false,"pushed_at":"2020-08-26T20:42:01.000Z","size":13378,"stargazers_count":220,"open_issues_count":0,"forks_count":135,"subscribers_count":10,"default_branch":"master","last_synced_at":"2023-11-07T16:14:58.925Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":"","language":null,"has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"apache-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/dmatrix.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2020-04-15T15:11:49.000Z","updated_at":"2023-10-26T16:17:24.000Z","dependencies_parsed_at":"2022-08-22T13:01:43.686Z","dependency_job_id":null,"html_url":"https://github.com/dmatrix/mlflow-workshop-part-1","commit_stats":null,"previous_names":[],"tags_count":0,"template":null,"template_full_name":null,"purl":"pkg:github/dmatrix/mlflow-workshop-part-1","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dmatrix%2Fmlflow-workshop-part-1","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dmatrix%2Fmlflow-workshop-part-1/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dmatrix%2Fmlflow-workshop-part-1/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dmatrix%2Fmlflow-workshop-part-1/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/dmatrix","download_url":"https://codeload.github.com/dmatrix/mlflow-workshop-part-1/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dmatrix%2Fmlflow-workshop-part-1/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":33946818,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-05-26T15:22:16.424Z","status":"online","status_checked_at":"2026-06-05T02:00:06.157Z","response_time":120,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2026-06-05T14:31:56.057Z","updated_at":"2026-06-05T14:31:56.119Z","avatar_url":"https://github.com/dmatrix.png","language":null,"funding_links":[],"categories":[],"sub_categories":[],"readme":" Managing the Complete Machine Learning Lifecycle with MLflow\n=============================================================\n![](images/mlflow-workshop.png)\n\nPart 1 of 3\n-----------\nOther parts:\n- [Part 2](https://github.com/dmatrix/mlflow-workshop-part-2)\n- [Part 3](https://github.com/dmatrix/mlflow-workshop-part-3)\n- [Watch workshops on YouTube](https://www.youtube.com/playlist?list=PLTPXxbhUt-YWjDg318nmSxRqTgZFWQ2ZC)\n\nContent for the MLflow Series\n-----------------------------\nMachine Learning (ML) development brings many new complexities beyond the traditional software development lifecycle. Unlike in traditional software development, ML developers want to try multiple algorithms, tools and parameters to get the best results, and they need to track this information to reproduce work. In addition, developers need to use many distinct systems to productionize models.\n\nTo solve these challenges, [MLflow](https://mlflow.org), an open source project, simplifies the entire ML lifecycle. MLflow introduces simple abstractions to package reproducible projects, track results, \nencapsulate models that can be used with many existing tools, and central respositry to share models,\naccelerating the ML lifecycle for organizations of any size.\n\nGoal and Objective\n------------------\nAimed at beginner or intermediate level, this three-part series aims to educate data scientists or ML developer in how you \nleverage MLflow as a platform to track experiments, package projects to reproduce runs, use model flavors to deploy in diverse environments, and manage models in a central respository for sharing.\n\nWhat you will learn\n-------------------\nUnderstand the four main components of open source MLflow——MLflow Tracking, MLflow Projects, MLflow Models, and Model Registry—and how each compopnent helps address challenges of the ML lifecycle.\n * How to use [MLflow Tracking](https://mlflow.org/docs/latest/tracking.html) to record and query experiments: code, data, config, and results.\n * How to use [MLflow Projects](https://mlflow.org/docs/latest/projects.html) packaging format to reproduce runs\n * How to use [MLflow Models](https://mlflow.org/docs/latest/models.html) general format to send models to diverse deployment tools.\n * How to use [Model Registry](https://mlflow.org/docs/latest/model-registry.html) for collaborative model lifecycle management\n * How to use [MLflow UI](https://mlflow.org/docs/latest/tracking.html#tracking-ui) to visually compare and contrast experimental runs with different tuning parameters and evaluate metrics\n\n\nInstructor\n-----------\n\n- [Jules S. Damji](https://www.linkedin.com/in/dmatrix/) [@2twitme](https://twitter.com/2twitme) \n---\n\n\nAbout the MLflow workshop part 1\n--------------------------------\n\nIn this part 1, we will cover:\n * Concepts and motivation behind MLflow\n * Learn how to use Databricks Community Edition (DCE)\n * Tour of the the MLflow API Documentation\n * Introduce MLflow Python Fluent Tracking APIs\n * Walk and work through a three machine learning models using MLflow APIs in the DCE\n * Use the MLflow UI as part of DCE to compare experiment metrics, parameters, and runs\n\nPrerequisites\n-------------\n* Before the session, please pre-register for [Databricks Community Edition](https://databricks.com/try-databricks)\n* Knowledge of Python 3 and programming in general\n* Preferably a UNIX-based, fully-charged laptop with 8-16 GB, with a Chrome or Firefox browser\n* Familiarity with GitHub, git, and an account on Github\n* Some knowledge of Machine Learning concepts, libraries, and frameworks \n     * scikit-learn\n     * pandas and Numpy\n     * matplotlib\n* [**optional for part-1**] PyCharm/IntelliJ or choice of syntax-based Python editor\n* [**optional for part-1**] pip/pip3 or conda and Python 3 installed\n* Loads of virtual laughter, curiosity, and a sense of humor ... :-)\n\nObtaining the Tutorial Material\n--------------------------------\n\nFamiliarity with **git** is important so that you can get all the material easily during the tutorial and\nworkshop as well as continue to work in your free time, after the session is over.\n\n``` git clone git@github.com:dmatrix/mlflow-workshop-part-1.git or git clone https://github.com/dmatrix/mlflow-workshop-part-1.git```\n\nDocumentation Resources\n-----------------------\n\nThis tutorial will refer to documentation: \n\n1. [MLflow](https://mlflow.org/docs/latest/index.html) \n2. [Numpy](https://numpy.org/devdocs/user/quickstart.html)\n3. [Pandas](https://pandas.pydata.org/pandas-docs/stable/reference/index.html)\n4. [Scikit-Learn](https://scikit-learn.org/stable/index.html)\n5. [Keras](https://keras.io/optimizers/)\n6. [TensorFlow](https://tensorflow.org)\n7. [Matplotlib](https://matplotlib.org/3.2.0/tutorials/introductory/pyplot.html)\n\nHow to get started\n-------------------\nWe will walk through this during the session, but please sign up for [Databricks Community Edition](https://databricks.com/try-databricks) before the session :\n\n1. ``` git clone git@github.com:dmatrix/mlflow-workshop-part-1.git ```\n2. Use this [URL](https://community.cloud.databricks.com/login.html) to log into the Databricks Community Edition\n\n![](images/databricks_ce_loging.png)\n\n3. Create a ML Runtime 6.5 Cluster\n\n![](images/databricks_ce_create_mlr.png)\n\n4. In the brower: \n  * (1) Go the GitHub **notebooks** subdirectory\n  * (2) Download **MLFlow-CE.dbc** file on your laptop\n\n![](images/databricks_ce_download_notebooks.png)\n\n5. Import the **MLFlow-CE.dbc** file into the Databricks Community Edition\n\n![](images/databricks_ce_import_notebooks.png)\n\nLet's go!\n\nCheers,\n\nJules\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdmatrix%2Fmlflow-workshop-part-1","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fdmatrix%2Fmlflow-workshop-part-1","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdmatrix%2Fmlflow-workshop-part-1/lists"}