{"id":13405544,"url":"https://github.com/kelvins/awesome-mlops","last_synced_at":"2025-09-28T14:30:59.089Z","repository":{"id":37090334,"uuid":"266895706","full_name":"kelvins/awesome-mlops","owner":"kelvins","description":":sunglasses: A curated list of awesome MLOps tools","archived":false,"fork":false,"pushed_at":"2024-09-23T18:26:47.000Z","size":423,"stargazers_count":4068,"open_issues_count":4,"forks_count":565,"subscribers_count":89,"default_branch":"main","last_synced_at":"2024-10-30T23:43:16.646Z","etag":null,"topics":["ai","awesome","data-science","machine-learning","machine-learning-engineering","ml","mle","mlops"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/kelvins.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":"CONTRIBUTING.md","funding":null,"license":null,"code_of_conduct":"CODE_OF_CONDUCT.md","threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2020-05-25T22:53:26.000Z","updated_at":"2024-10-30T14:54:05.000Z","dependencies_parsed_at":"2023-11-12T14:21:54.743Z","dependency_job_id":"1323a716-3b25-4292-87c5-30b9954040fc","html_url":"https://github.com/kelvins/awesome-mlops","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/kelvins%2Fawesome-mlops","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/kelvins%2Fawesome-mlops/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/kelvins%2Fawesome-mlops/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/kelvins%2Fawesome-mlops/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/kelvins","download_url":"https://codeload.github.com/kelvins/awesome-mlops/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":234525630,"owners_count":18846937,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["ai","awesome","data-science","machine-learning","machine-learning-engineering","ml","mle","mlops"],"created_at":"2024-07-30T19:02:05.069Z","updated_at":"2025-09-28T14:30:59.071Z","avatar_url":"https://github.com/kelvins.png","language":"Python","funding_links":[],"categories":["Python","Models and Tools","Uncategorized","Related awesome lists","人工智能（AI）","其他_机器学习与深度学习","To Sort","Profiling","Other Lists","System \u0026 Production","Awesome Lists","Machine Learning Engineering Operations (MLOps) ##","语言资源库","\u003ca id=\"meta\"\u003e\u003c/a\u003e🏁 Meta","Awesome MLOPs","General Directories","🚀 MLOps"],"sub_categories":["General MLOps Tools","Uncategorized","MLOps/ModelOps","Vector Database","Profiling","awesome-library","TeX Lists","Bleeding Edge ⚗️","Resources"],"readme":"# Awesome MLOps [![Awesome](https://cdn.rawgit.com/sindresorhus/awesome/d7305f38d29fed78fa85652e3a63e154dd8e8829/media/badge.svg)](https://github.com/sindresorhus/awesome)\n\nA curated list of awesome MLOps tools.\n\nInspired by [awesome-python](https://github.com/vinta/awesome-python).\n\n- [Awesome MLOps](#awesome-mlops)\n    - [AutoML](#automl)\n    - [CI/CD for Machine Learning](#cicd-for-machine-learning)\n    - [Cron Job Monitoring](#cron-job-monitoring)\n    - [Data Catalog](#data-catalog)\n    - [Data Enrichment](#data-enrichment)\n    - [Data Exploration](#data-exploration)\n    - [Data Management](#data-management)\n    - [Data Processing](#data-processing)\n    - [Data Validation](#data-validation)\n    - [Data Visualization](#data-visualization)\n    - [Drift Detection](#drift-detection)\n    - [Feature Engineering](#feature-engineering)\n    - [Feature Store](#feature-store)\n    - [Hyperparameter Tuning](#hyperparameter-tuning)\n    - [Knowledge Sharing](#knowledge-sharing)\n    - [Machine Learning Platform](#machine-learning-platform)\n    - [Model Fairness and Privacy](#model-fairness-and-privacy)\n    - [Model Interpretability](#model-interpretability)\n    - [Model Lifecycle](#model-lifecycle)\n    - [Model Serving](#model-serving)\n    - [Model Testing \u0026 Validation](#model-testing--validation)\n    - [Optimization Tools](#optimization-tools)\n    - [Simplification Tools](#simplification-tools)\n    - [Visual Analysis and Debugging](#visual-analysis-and-debugging)\n    - [Workflow Tools](#workflow-tools)\n- [Resources](#resources)\n    - [Articles](#articles)\n    - [Books](#books)\n    - [Events](#events)\n    - [Other Lists](#other-lists)\n    - [Podcasts](#podcasts)\n    - [Slack](#slack)\n    - [Websites](#websites)\n- [Contributing](#contributing)\n\n---\n\n## AutoML\n\n*Tools for performing AutoML.*\n\n* [AutoGluon](https://github.com/awslabs/autogluon) - Automated machine learning for image, text, tabular, time-series, and multi-modal data.\n* [AutoKeras](https://github.com/keras-team/autokeras) - AutoKeras goal is to make machine learning accessible for everyone.\n* [AutoPyTorch](https://github.com/automl/Auto-PyTorch) - Automatic architecture search and hyperparameter optimization for PyTorch.\n* [AutoSKLearn](https://github.com/automl/auto-sklearn) - Automated machine learning toolkit and a drop-in replacement for a scikit-learn estimator.\n* [EvalML](https://github.com/alteryx/evalml) - A library that builds, optimizes, and evaluates ML pipelines using domain-specific functions.\n* [FLAML](https://github.com/microsoft/FLAML) - Finds accurate ML models automatically, efficiently and economically.\n* [H2O AutoML](https://h2o.ai/platform/h2o-automl) - Automates ML workflow, which includes automatic training and tuning of models.\n* [MindsDB](https://github.com/mindsdb/mindsdb) - AI layer for databases that allows you to effortlessly develop, train and deploy ML models.\n* [MLBox](https://github.com/AxeldeRomblay/MLBox) - MLBox is a powerful Automated Machine Learning python library.\n* [Model Search](https://github.com/google/model_search) - Framework that implements AutoML algorithms for model architecture search at scale.\n* [NNI](https://github.com/microsoft/nni) - An open source AutoML toolkit for automate machine learning lifecycle.\n\n## CI/CD for Machine Learning\n\n*Tools for performing CI/CD for Machine Learning.*\n\n* [ClearML](https://github.com/allegroai/clearml) - Auto-Magical CI/CD to streamline your ML workflow.\n* [CML](https://github.com/iterative/cml) - Open-source library for implementing CI/CD in machine learning projects.\n* [KitOps](https://github.com/jozu-ai/kitops) – Open source MLOps project that eases model handoffs between data scientist and DevOps. \n\n## Cron Job Monitoring\n\n*Tools for monitoring cron jobs (recurring jobs).*\n\n* [Cronitor](https://cronitor.io/cron-job-monitoring) - Monitor any cron job or scheduled task.\n* [HealthchecksIO](https://healthchecks.io/) - Simple and effective cron job monitoring.\n\n## Data Catalog\n\n*Tools for data cataloging.*\n\n* [Amundsen](https://www.amundsen.io/) - Data discovery and metadata engine for improving the productivity when interacting with data.\n* [Apache Atlas](https://atlas.apache.org) - Provides open metadata management and governance capabilities to build a data catalog.\n* [CKAN](https://github.com/ckan/ckan) - Open-source DMS (data management system) for powering data hubs and data portals.\n* [DataHub](https://github.com/linkedin/datahub) - LinkedIn's generalized metadata search \u0026 discovery tool.\n* [Magda](https://github.com/magda-io/magda) - A federated, open-source data catalog for all your big data and small data.\n* [Metacat](https://github.com/Netflix/metacat) - Unified metadata exploration API service for Hive, RDS, Teradata, Redshift, S3 and Cassandra.\n* [OpenMetadata](https://open-metadata.org/) - A Single place to discover, collaborate and get your data right.\n\n## Data Enrichment\n\n*Tools and libraries for data enrichment.*\n\n* [Snorkel](https://github.com/snorkel-team/snorkel) - A system for quickly generating training data with weak supervision.\n* [Upgini](https://github.com/upgini/upgini) - Enriches training datasets with features from public and community shared data sources.\n\n## Data Exploration\n\n*Tools for performing data exploration.*\n\n* [Apache Zeppelin](https://zeppelin.apache.org/) - Enables data-driven, interactive data analytics and collaborative documents.\n* [BambooLib](https://github.com/tkrabel/bamboolib) -  An intuitive GUI for Pandas DataFrames.\n* [DataPrep](https://github.com/sfu-db/dataprep) - Collect, clean and visualize your data in Python.\n* [Google Colab](https://colab.research.google.com) - Hosted Jupyter notebook service that requires no setup to use.\n* [Jupyter Notebook](https://jupyter.org/) - Web-based notebook environment for interactive computing.\n* [JupyterLab](https://jupyterlab.readthedocs.io) - The next-generation user interface for Project Jupyter.\n* [Jupytext](https://github.com/mwouts/jupytext) - Jupyter Notebooks as Markdown Documents, Julia, Python or R scripts.\n* [Pandas Profiling](https://github.com/ydataai/pandas-profiling) - Create HTML profiling reports from pandas DataFrame objects.\n* [Polynote](https://polynote.org/) - The polyglot notebook with first-class Scala support.\n\n## Data Management\n\n*Tools for performing data management.*\n\n* [Arrikto](https://www.arrikto.com/) - Dead simple, ultra fast storage for the hybrid Kubernetes world.\n* [BlazingSQL](https://github.com/BlazingDB/blazingsql) - A lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.\n* [Delta Lake](https://github.com/delta-io/delta) - Storage layer that brings scalable, ACID transactions to Apache Spark and other engines.\n* [Dolt](https://github.com/dolthub/dolt) - SQL database that you can fork, clone, branch, merge, push and pull just like a git repository.\n* [Dud](https://github.com/kevin-hanselman/dud) - A lightweight CLI tool for versioning data alongside source code and building data pipelines.\n* [DVC](https://dvc.org/) - Management and versioning of datasets and machine learning models.\n* [Git LFS](https://git-lfs.github.com) - An open source Git extension for versioning large files.\n* [Hub](https://github.com/activeloopai/Hub) - A dataset format for creating, storing, and collaborating on AI datasets of any size.\n* [Intake](https://github.com/intake/intake) - A lightweight set of tools for loading and sharing data in data science projects.\n* [lakeFS](https://github.com/treeverse/lakeFS) - Repeatable, atomic and versioned data lake on top of object storage.\n* [Marquez](https://github.com/MarquezProject/marquez) - Collect, aggregate, and visualize a data ecosystem's metadata.\n* [Milvus](https://github.com/milvus-io/milvus/) - An open source embedding vector similarity search engine powered by Faiss, NMSLIB and Annoy.\n* [Pinecone](https://www.pinecone.io) - Managed and distributed vector similarity search used with a lightweight SDK.\n* [Qdrant](https://github.com/qdrant/qdrant) - An open source vector similarity search engine with extended filtering support.\n* [Quilt](https://github.com/quiltdata/quilt) - A self-organizing data hub with S3 support.\n\n## Data Processing\n\n*Tools related to data processing and data pipelines.*\n\n* [Airflow](https://airflow.apache.org/) - Platform to programmatically author, schedule, and monitor workflows.\n* [Azkaban](https://github.com/azkaban/azkaban) - Batch workflow job scheduler created at LinkedIn to run Hadoop jobs.\n* [Dagster](https://github.com/dagster-io/dagster) - A data orchestrator for machine learning, analytics, and ETL.\n* [Hadoop](https://hadoop.apache.org/) - Framework that allows for the distributed processing of large data sets across clusters.\n* [OpenRefine](https://github.com/OpenRefine/OpenRefine) - Power tool for working with messy data and improving it.\n* [Spark](https://spark.apache.org/) - Unified analytics engine for large-scale data processing.\n\n## Data Validation\n\n*Tools related to data validation.*\n\n* [Cerberus](https://github.com/pyeve/cerberus) - Lightweight, extensible data validation library for Python.\n* [Cleanlab](https://github.com/cleanlab/cleanlab) - Python library for data-centric AI and machine learning with messy, real-world data and labels.\n* [Great Expectations](https://greatexpectations.io) - A Python data validation framework that allows to test your data against datasets.\n* [JSON Schema](https://json-schema.org/) - A vocabulary that allows you to annotate and validate JSON documents.\n* [TFDV](https://github.com/tensorflow/data-validation) - An library for exploring and validating machine learning data.\n\n## Data Visualization\n\n*Tools for data visualization, reports and dashboards.*\n\n* [Count](https://count.co) - SQL/drag-and-drop querying and visualisation tool based on notebooks.\n* [Dash](https://github.com/plotly/dash) - Analytical Web Apps for Python, R, Julia, and Jupyter.\n* [Data Studio](https://datastudio.google.com) - Reporting solution for power users who want to go beyond the data and dashboards of GA.\n* [Facets](https://github.com/PAIR-code/facets) - Visualizations for understanding and analyzing machine learning datasets.\n* [Grafana](https://grafana.com/grafana/) -  Multi-platform open source analytics and interactive visualization web application.\n* [Lux](https://github.com/lux-org/lux) - Fast and easy data exploration by automating the visualization and data analysis process.\n* [Metabase](https://www.metabase.com/) - The simplest, fastest way to get business intelligence and analytics to everyone.\n* [Redash](https://redash.io/) - Connect to any data source, easily visualize, dashboard and share your data.\n* [SolidUI](https://github.com/CloudOrc/SolidUI) - AI-generated visualization prototyping and editing platform, support 2D and 3D models.\n* [Superset](https://superset.incubator.apache.org/) - Modern, enterprise-ready business intelligence web application.\n* [Tableau](https://www.tableau.com) - Powerful and fastest growing data visualization tool used in the business intelligence industry.\n\n## Drift Detection\n\n*Tools and libraries related to drift detection.*\n\n* [Alibi Detect](https://github.com/SeldonIO/alibi-detect) - An open source Python library focused on outlier, adversarial and drift detection.\n* [Frouros](https://github.com/IFCA/frouros) - An open source Python library for drift detection in machine learning systems.\n* [TorchDrift](https://github.com/torchdrift/torchdrift/) - A data and concept drift library for PyTorch.\n\n## Feature Engineering\n\n*Tools and libraries related to feature engineering.*\n\n* [Feature Engine](https://github.com/feature-engine/feature_engine) - Feature engineering package with SKlearn like functionality.\n* [Featuretools](https://github.com/alteryx/featuretools) - Python library for automated feature engineering.\n* [TSFresh](https://github.com/blue-yonder/tsfresh) - Python library for automatic extraction of relevant features from time series.\n\n## Feature Store\n\n*Feature store tools for data serving.*\n\n* [Butterfree](https://github.com/quintoandar/butterfree) - A tool for building feature stores. Transform your raw data into beautiful features.\n* [ByteHub](https://github.com/bytehub-ai/bytehub) - An easy-to-use feature store. Optimized for time-series data.\n* [Feast](https://feast.dev/) - End-to-end open source feature store for machine learning.\n* [Feathr](https://github.com/linkedin/feathr) - An enterprise-grade, high performance feature store.\n* [Featureform](https://github.com/featureform/featureform) - A Virtual Feature Store. Turn your existing data infrastructure into a feature store.\n* [Tecton](https://www.tecton.ai/) - A fully-managed feature platform built to orchestrate the complete lifecycle of features.\n\n## Hyperparameter Tuning\n\n*Tools and libraries to perform hyperparameter tuning.*\n\n* [Advisor](https://github.com/tobegit3hub/advisor) - Open-source implementation of Google Vizier for hyper parameters tuning.\n* [Hyperas](https://github.com/maxpumperla/hyperas) - A very simple wrapper for convenient hyperparameter optimization.\n* [Hyperopt](https://github.com/hyperopt/hyperopt) - Distributed Asynchronous Hyperparameter Optimization in Python.\n* [Katib](https://github.com/kubeflow/katib) - Kubernetes-based system for hyperparameter tuning and neural architecture search.\n* [KerasTuner](https://github.com/keras-team/keras-tuner) - Easy-to-use, scalable hyperparameter optimization framework.\n* [Optuna](https://optuna.org/) - Open source hyperparameter optimization framework to automate hyperparameter search.\n* [Scikit Optimize](https://github.com/scikit-optimize/scikit-optimize) - Simple and efficient library to minimize expensive and noisy black-box functions.\n* [Talos](https://github.com/autonomio/talos) - Hyperparameter Optimization for TensorFlow, Keras and PyTorch.\n* [Tune](https://docs.ray.io/en/latest/tune.html) - Python library for experiment execution and hyperparameter tuning at any scale.\n\n## Knowledge Sharing\n\n*Tools for sharing knowledge to the entire team/company.*\n\n* [Knowledge Repo](https://github.com/airbnb/knowledge-repo) - Knowledge sharing platform for data scientists and other technical professions.\n* [Kyso](https://kyso.io/) - One place for data insights so your entire team can learn from your data.\n\n## Machine Learning Platform\n\n*Complete machine learning platform solutions.*\n\n* [aiWARE](https://www.veritone.com/aiware/aiware-os/) - aiWARE helps MLOps teams evaluate, deploy, integrate, scale \u0026 monitor ML models.\n* [Algorithmia](https://algorithmia.com/) - Securely govern your machine learning operations with a healthy ML lifecycle.\n* [Allegro AI](https://allegro.ai/) - Transform ML/DL research into products. Faster.\n* [Bodywork](https://bodywork.readthedocs.io/en/latest/) - Deploys machine learning projects developed in Python, to Kubernetes.\n* [CNVRG](https://cnvrg.io/) - An end-to-end machine learning platform to build and deploy AI models at scale.\n* [DAGsHub](https://dagshub.com/) - A platform built on open source tools for data, model and pipeline management.\n* [Dataiku](https://www.dataiku.com/) - Platform democratizing access to data and enabling enterprises to build their own path to AI.\n* [DataRobot](https://www.datarobot.com/) - AI platform that democratizes data science and automates the end-to-end ML at scale.\n* [Domino](https://www.dominodatalab.com/) - One place for your data science tools, apps, results, models, and knowledge.\n* [Edge Impulse](https://edgeimpulse.com/) - Platform for creating, optimizing, and deploying AI/ML algorithms for edge devices.\n* [envd](https://github.com/tensorchord/envd) - Machine learning development environment for data science and AI/ML engineering teams.\n* [FedML](https://fedml.ai/) - Simplifies the workflow of federated learning anywhere at any scale.\n* [Gradient](https://gradient.paperspace.com/) - Multicloud CI/CD and MLOps platform for machine learning teams.\n* [H2O](https://www.h2o.ai/) - Open source leader in AI with a mission to democratize AI for everyone.\n* [Hopsworks](https://www.hopsworks.ai/) - Open-source platform for developing and operating machine learning models at scale.\n* [Iguazio](https://www.iguazio.com/) - Data science platform that automates MLOps with end-to-end machine learning pipelines.\n* [Katonic](https://katonic.ai/) - Automate your cycle of intelligence with Katonic MLOps Platform.\n* [Knime](https://www.knime.com/) - Create and productionize data science using one easy and intuitive environment.\n* [Kubeflow](https://www.kubeflow.org/) - Making deployments of ML workflows on Kubernetes simple, portable and scalable.\n* [LynxKite](https://lynxkite.com/) - A complete graph data science platform for very large graphs and other datasets.\n* [ML Workspace](https://github.com/ml-tooling/ml-workspace) - All-in-one web-based IDE specialized for machine learning and data science.\n* [MLReef](https://github.com/MLReef/mlreef) - Open source MLOps platform that helps you collaborate, reproduce and share your ML work.\n* [Modzy](https://www.modzy.com/) - Deploy, connect, run, and monitor machine learning (ML) models in the enterprise and at the edge.\n* [Neu.ro](https://neu.ro) - MLOps platform that integrates open-source and proprietary tools into client-oriented systems.\n* [Omnimizer](https://www.omniml.ai) - Simplifies and accelerates MLOps by bridging the gap between ML models and edge hardware.\n* [Pachyderm](https://www.pachyderm.com/) - Combines data lineage with end-to-end pipelines on Kubernetes, engineered for the enterprise.\n* [Polyaxon](https://www.github.com/polyaxon/polyaxon/) - A platform for reproducible and scalable machine learning and deep learning on kubernetes.\n* [Sagemaker](https://aws.amazon.com/sagemaker/) - Fully managed service that provides the ability to build, train, and deploy ML models quickly.\n* [SAS Viya](https://www.sas.com/en_us/software/viya.html) - Cloud native AI, analytic and data management platform that supports the analytics life cycle.\n* [Sematic](https://sematic.dev) - An open-source end-to-end pipelining tool to go from laptop prototype to cloud in no time.\n* [SigOpt](https://sigopt.com/) - A platform that makes it easy to track runs, visualize training, and scale hyperparameter tuning.\n* [TrueFoundry](https://www.truefoundry.com) - A Cloud-native MLOps Platform over Kubernetes to simplify training and serving of ML Models.\n* [Valohai](https://valohai.com/) - Takes you from POC to production while managing the whole model lifecycle.\n\n## Model Fairness and Privacy\n\n*Tools for performing model fairness and privacy in production.*\n\n* [AIF360](https://github.com/Trusted-AI/AIF360) - A comprehensive set of fairness metrics for datasets and machine learning models.\n* [Fairlearn](https://github.com/fairlearn/fairlearn) - A Python package to assess and improve fairness of machine learning models.\n* [Opacus](https://github.com/pytorch/opacus) - A library that enables training PyTorch models with differential privacy.\n* [TensorFlow Privacy](https://github.com/tensorflow/privacy) - Library for training machine learning models with privacy for training data.\n\n## Model Interpretability\n\n*Tools for performing model interpretability/explainability.*\n\n* [Alibi](https://github.com/SeldonIO/alibi) - Open-source Python library enabling ML model inspection and interpretation.\n* [Captum](https://github.com/pytorch/captum) - Model interpretability and understanding library for PyTorch.\n* [ELI5](https://github.com/eli5-org/eli5) - Python package which helps to debug machine learning classifiers and explain their predictions.\n* [InterpretML](https://github.com/interpretml/interpret) - A toolkit to help understand models and enable responsible machine learning.\n* [LIME](https://github.com/marcotcr/lime) - Explaining the predictions of any machine learning classifier.\n* [Lucid](https://github.com/tensorflow/lucid) - Collection of infrastructure and tools for research in neural network interpretability.\n* [SAGE](https://github.com/iancovert/sage) - For calculating global feature importance using Shapley values.\n* [SHAP](https://github.com/slundberg/shap) - A game theoretic approach to explain the output of any machine learning model.\n\n## Model Lifecycle\n\n*Tools for managing model lifecycle (tracking experiments, parameters and metrics).*\n\n* [Aeromancy](https://github.com/quant-aq/aeromancy) - A framework for performing reproducible AI and ML for Weights and Biases.\n* [Aim](https://github.com/aimhubio/aim) - A super-easy way to record, search and compare 1000s of ML training runs.\n* [Cascade](https://github.com/Oxid15/cascade) - Library of ML-Engineering tools for rapid prototyping and experiment management.\n* [Comet](https://github.com/comet-ml/comet-examples) - Track your datasets, code changes, experimentation history, and models.\n* [Guild AI](https://guild.ai/) - Open source experiment tracking, pipeline automation, and hyperparameter tuning.\n* [Keepsake](https://github.com/replicate/keepsake) - Version control for machine learning with support to Amazon S3 and Google Cloud Storage.\n* [Losswise](https://losswise.com) - Makes it easy to track the progress of a machine learning project.\n* [MLflow](https://mlflow.org/) - Open source platform for the machine learning lifecycle.\n* [ModelDB](https://github.com/VertaAI/modeldb/) - Open source ML model versioning, metadata, and experiment management.\n* [Neptune AI](https://neptune.ai/) - The most lightweight experiment management tool that fits any workflow.\n* [Sacred](https://github.com/IDSIA/sacred) - A tool to help you configure, organize, log and reproduce experiments.\n* [Weights and Biases](https://github.com/wandb/client) - A tool for visualizing and tracking your machine learning experiments.\n\n## Model Serving\n\n*Tools for serving models in production.*\n\n* [Banana](https://banana.dev) - Host your ML inference code on serverless GPUs and integrate it into your app with one line of code.\n* [Beam](https://beam.cloud) - Develop on serverless GPUs, deploy highly performant APIs, and rapidly prototype ML models.\n* [BentoML](https://github.com/bentoml/BentoML) - Open-source platform for high-performance ML model serving.\n* [BudgetML](https://github.com/ebhy/budgetml) - Deploy a ML inference service on a budget in less than 10 lines of code.\n* [Cog](https://github.com/replicate/cog) - Open-source tool that lets you package ML models in a standard, production-ready container.\n* [Cortex](https://www.cortex.dev/) - Machine learning model serving infrastructure.\n* [Geniusrise](https://docs.geniusrise.ai) - Host inference APIs, bulk inference and fine tune text, vision, audio and multi-modal models.\n* [Gradio](https://github.com/gradio-app/gradio) - Create customizable UI components around your models.\n* [GraphPipe](https://oracle.github.io/graphpipe) - Machine learning model deployment made simple.\n* [Hydrosphere](https://github.com/Hydrospheredata/hydro-serving) - Platform for deploying your Machine Learning to production.\n* [KFServing](https://github.com/kubeflow/kfserving) - Kubernetes custom resource definition for serving ML models on arbitrary frameworks.\n* [LocalAI](https://github.com/mudler/LocalAI) - Drop-in replacement REST API that’s compatible with OpenAI API specifications for inferencing.\n* [Merlin](https://github.com/gojek/merlin) - A platform for deploying and serving machine learning models.\n* [MLEM](https://github.com/iterative/mlem) - Version and deploy your ML models following GitOps principles.\n* [Opyrator](https://github.com/ml-tooling/opyrator) - Turns your ML code into microservices with web API, interactive GUI, and more.\n* [PredictionIO](https://github.com/apache/predictionio) - Event collection, deployment of algorithms, evaluation, querying predictive results via APIs.\n* [Quix](https://quix.io) - Serverless platform for processing data streams in real-time with machine learning models.\n* [Rune](https://github.com/hotg-ai/rune) - Provides containers to encapsulate and deploy EdgeML pipelines and applications.\n* [Seldon](https://www.seldon.io/) - Take your ML projects from POC to production with maximum efficiency and minimal risk.\n* [Streamlit](https://github.com/streamlit/streamlit) - Lets you create apps for your ML projects with deceptively simple Python scripts.\n* [TensorFlow Serving](https://www.tensorflow.org/tfx/guide/serving) - Flexible, high-performance serving system for ML models, designed for production.\n* [TorchServe](https://github.com/pytorch/serve) - A flexible and easy to use tool for serving PyTorch models.\n* [Triton Inference Server](https://github.com/triton-inference-server/server) - Provides an optimized cloud and edge inferencing solution.\n* [Vespa](https://github.com/vespa-engine/vespa) - Store, search, organize and make machine-learned inferences over big data at serving time.\n* [Wallaroo.AI](https://wallaroo.ai/) - A platform for deploying, serving, and optimizing ML models in both cloud and edge environments.\n\n## Model Testing \u0026 Validation\n\n*Tools for testing and validating models.*\n\n* [Deepchecks](https://github.com/deepchecks/deepchecks) - Open-source package for validating ML models \u0026 data, with various checks and suites.\n* [Starwhale](https://github.com/star-whale/starwhale) - An MLOps/LLMOps platform for model building, evaluation, and fine-tuning.\n* [Trubrics](https://github.com/trubrics/trubrics-sdk) - Validate machine learning with data science and domain expert feedback.\n\n## Optimization Tools\n\n*Optimization tools related to model scalability in production.*\n\n* [Accelerate](https://github.com/huggingface/accelerate) - A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision.\n* [Dask](https://dask.org/) - Provides advanced parallelism for analytics, enabling performance at scale for the tools you love.\n* [DeepSpeed](https://github.com/microsoft/DeepSpeed) - Deep learning optimization library that makes distributed training easy, efficient, and effective.\n* [Fiber](https://uber.github.io/fiber/) - Python distributed computing library for modern computer clusters.\n* [Horovod](https://github.com/horovod/horovod) - Distributed deep learning training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.\n* [Mahout](https://mahout.apache.org/) - Distributed linear algebra framework and mathematically expressive Scala DSL.\n* [MLlib](https://spark.apache.org/mllib/) - Apache Spark's scalable machine learning library.\n* [Modin](https://github.com/modin-project/modin) - Speed up your Pandas workflows by changing a single line of code.\n* [Nebullvm](https://github.com/nebuly-ai/nebullvm) - Easy-to-use library to boost AI inference.\n* [Nos](https://github.com/nebuly-ai/nos) - Open-source module for running AI workloads on Kubernetes in an optimized way.\n* [Petastorm](https://github.com/uber/petastorm) - Enables single machine or distributed training and evaluation of deep learning models.\n* [Rapids](https://rapids.ai/index.html) - Gives the ability to execute end-to-end data science and analytics pipelines entirely on GPUs.\n* [Ray](https://github.com/ray-project/ray) - Fast and simple framework for building and running distributed applications.\n* [Singa](http://singa.apache.org/en/index.html) - Apache top level project, focusing on distributed training of DL and ML models.\n* [Tpot](https://github.com/EpistasisLab/tpot) - Automated ML tool that optimizes machine learning pipelines using genetic programming.\n\n## Simplification Tools\n\n*Tools related to machine learning simplification and standardization.*\n\n* [Chassis](https://chassisml.io) - Turns models into ML-friendly containers that run just about anywhere.\n* [Hermione](https://github.com/a3data/hermione) - Help Data Scientists on setting up more organized codes, in a quicker and simpler way.\n* [Hydra](https://github.com/facebookresearch/hydra) - A framework for elegantly configuring complex applications.\n* [Koalas](https://github.com/databricks/koalas) - Pandas API on Apache Spark. Makes data scientists more productive when interacting with big data.\n* [Ludwig](https://github.com/uber/ludwig) - Allows users to train and test deep learning models without the need to write code.\n* [MLNotify](https://github.com/aporia-ai/mlnotify) - No need to keep checking your training, just one import line and you'll know the second it's done.\n* [PyCaret](https://pycaret.org/) - Open source, low-code machine learning library in Python.\n* [Sagify](https://github.com/Kenza-AI/sagify) - A CLI utility to train and deploy ML/DL models on AWS SageMaker.\n* [Soopervisor](https://github.com/ploomber/soopervisor) - Export ML projects to Kubernetes (Argo workflows), Airflow, AWS Batch, and SLURM.\n* [Soorgeon](https://github.com/ploomber/soorgeon) - Convert monolithic Jupyter notebooks into maintainable pipelines.\n* [TrainGenerator](https://github.com/jrieke/traingenerator) - A web app to generate template code for machine learning.\n* [Turi Create](https://github.com/apple/turicreate) - Simplifies the development of custom machine learning models.\n\n## Visual Analysis and Debugging\n\n*Tools for performing visual analysis and debugging of ML/DL models.*\n\n* [Aporia](https://www.aporia.com/) - Observability with customized monitoring and explainability for ML models.\n* [Arize](https://www.arize.com/) - A free end-to-end ML observability and model monitoring platform.\n* [Evidently](https://github.com/evidentlyai/evidently) - Interactive reports to analyze ML models during validation or production monitoring.\n* [Fiddler](https://www.fiddler.ai/) - Monitor, explain, and analyze your AI in production.\n* [Manifold](https://github.com/uber/manifold) - A model-agnostic visual debugging tool for machine learning.\n* [NannyML](https://github.com/NannyML/nannyml) - Algorithm capable of fully capturing the impact of data drift on performance.\n* [Netron](https://github.com/lutzroeder/netron) - Visualizer for neural network, deep learning, and machine learning models.\n* [Opik](https://github.com/comet-ml/opik) - Evaluate, test, and ship LLM applications with a suite of observability tools.\n* [Phoenix](https://phoenix.arize.com) - MLOps in a Notebook for troubleshooting and fine-tuning generative LLM, CV, and tabular models.\n* [Radicalbit](https://github.com/radicalbit/radicalbit-ai-monitoring/) - The open source solution for monitoring your AI models in production.\n* [Superwise](https://www.superwise.ai) - Fully automated, enterprise-grade model observability in a self-service SaaS platform.\n* [Whylogs](https://github.com/whylabs/whylogs) - The open source standard for data logging. Enables ML monitoring and observability.\n* [Yellowbrick](https://github.com/DistrictDataLabs/yellowbrick) - Visual analysis and diagnostic tools to facilitate machine learning model selection.\n\n## Workflow Tools\n\n*Tools and frameworks to create workflows or pipelines in the machine learning context.*\n\n* [Argo](https://github.com/argoproj/argo) - Open source container-native workflow engine for orchestrating parallel jobs on Kubernetes.\n* [Automate Studio](https://www.veritone.com/applications/automate-studio/) - Rapidly build \u0026 deploy AI-powered workflows.\n* [Couler](https://github.com/couler-proj/couler) - Unified interface for constructing and managing workflows on different workflow engines.\n* [dstack](https://github.com/dstackai/dstack) - An open-core tool to automate data and training workflows.\n* [Flyte](https://flyte.org/) - Easy to create concurrent, scalable, and maintainable workflows for machine learning.\n* [Hamilton](https://github.com/dagworks-inc/hamilton) - A scalable general purpose micro-framework for defining dataflows.\n* [Kale](https://github.com/kubeflow-kale/kale) - Aims at simplifying the Data Science experience of deploying Kubeflow Pipelines workflows.\n* [Kedro](https://github.com/quantumblacklabs/kedro) - Library that implements software engineering best-practice for data and ML pipelines.\n* [Luigi](https://github.com/spotify/luigi) - Python module that helps you build complex pipelines of batch jobs.\n* [Metaflow](https://metaflow.org/) - Human-friendly lib that helps scientists and engineers build and manage data science projects.\n* [MLRun](https://github.com/mlrun/mlrun) - Generic mechanism for data scientists to build, run, and monitor ML tasks and pipelines.\n* [Orchest](https://github.com/orchest/orchest/) - Visual pipeline editor and workflow orchestrator with an easy to use UI and based on Kubernetes.\n* [Ploomber](https://github.com/ploomber/ploomber) - Write maintainable, production-ready pipelines. Develop locally, deploy to the cloud.\n* [Prefect](https://docs.prefect.io/) - A workflow management system, designed for modern infrastructure.\n* [VDP](https://github.com/instill-ai/vdp) - An open-source tool to seamlessly integrate AI for unstructured data into the modern data stack.\n* [Wordware](https://www.wordware.ai) - A web-hosted IDE where non-technical domain experts can build task-specific AI agents.\n* [ZenML](https://github.com/maiot-io/zenml) - An extensible open-source MLOps framework to create reproducible pipelines.\n\n---\n\n# Resources\n\nWhere to discover new tools and discuss about existing ones.\n\n## Articles\n\n* [Continuous Delivery for Machine Learning](https://martinfowler.com/articles/cd4ml.html) (Martin Fowler)\n* [Machine Learning Operations (MLOps): Overview, Definition, and Architecture](https://arxiv.org/abs/2205.02302) (arXiv)\n* [MLOps Roadmap: A Complete MLOps Career Guide](https://www.scaler.com/blog/mlops-roadmap/) (Scaler Blogs)\n* [MLOps: Continuous delivery and automation pipelines in machine learning](https://cloud.google.com/solutions/machine-learning/mlops-continuous-delivery-and-automation-pipelines-in-machine-learning) (Google)\n* [MLOps: Machine Learning as an Engineering Discipline](https://towardsdatascience.com/ml-ops-machine-learning-as-an-engineering-discipline-b86ca4874a3f) (Medium)\n* [Rules of Machine Learning: Best Practices for ML Engineering](https://developers.google.com/machine-learning/guides/rules-of-ml) (Google)\n* [The ML Test Score: A Rubric for ML Production Readiness and Technical Debt Reduction](https://static.googleusercontent.com/media/research.google.com/en//pubs/archive/aad9f93b86b7addfea4c419b9100c6cdd26cacea.pdf) (Google)\n* [What Is MLOps?](https://blogs.nvidia.com/blog/2020/09/03/what-is-mlops/) (NVIDIA)\n\n## Books\n\n* [Beginning MLOps with MLFlow](https://www.amazon.com/Beginning-MLOps-MLFlow-SageMaker-Microsoft/dp/1484265483) (Apress)\n* [Building Machine Learning Pipelines](https://www.oreilly.com/library/view/building-machine-learning/9781492053187) (O'Reilly)\n* [Building Machine Learning Powered Applications](https://www.oreilly.com/library/view/building-machine-learning/9781492045106) (O'Reilly)\n* [Deep Learning in Production](https://www.amazon.com/gp/product/6180033773) (AI Summer)\n* [Designing Machine Learning Systems](https://www.oreilly.com/library/view/designing-machine-learning/9781098107956) (O'Reilly)\n* [Engineering MLOps](https://www.packtpub.com/product/engineering-mlops/9781800562882) (Packt)\n* [Implementing MLOps in the Enterprise](https://www.oreilly.com/library/view/implementing-mlops-in/9781098136574) (O'Reilly)\n* [Introducing MLOps](https://www.oreilly.com/library/view/introducing-mlops/9781492083283) (O'Reilly)\n* [Kubeflow for Machine Learning](https://www.oreilly.com/library/view/kubeflow-for-machine/9781492050117) (O'Reilly)\n* [Kubeflow Operations Guide](https://www.oreilly.com/library/view/kubeflow-operations-guide/9781492053262) (O'Reilly)\n* [Machine Learning Design Patterns](https://www.oreilly.com/library/view/machine-learning-design/9781098115777) (O'Reilly)\n* [Machine Learning Engineering in Action](https://www.manning.com/books/machine-learning-engineering-in-action) (Manning)\n* [ML Ops: Operationalizing Data Science](https://www.oreilly.com/library/view/ml-ops-operationalizing/9781492074663) (O'Reilly)\n* [MLOps Engineering at Scale](https://www.manning.com/books/mlops-engineering-at-scale) (Manning)\n* [MLOps Lifecycle Toolkit](https://link.springer.com/book/10.1007/978-1-4842-9642-4) (Apress)\n* [Practical Deep Learning at Scale with MLflow](https://www.packtpub.com/product/practical-deep-learning-at-scale-with-mlflow/9781803241333) (Packt)\n* [Practical MLOps](https://www.oreilly.com/library/view/practical-mlops/9781098103002) (O'Reilly)\n* [Production-Ready Applied Deep Learning](https://www.packtpub.com/product/production-ready-applied-deep-learning/9781803243665) (Packt)\n* [Reliable Machine Learning](https://www.oreilly.com/library/view/reliable-machine-learning/9781098106218) (O'Reilly)\n* [The Machine Learning Solutions Architect Handbook](https://www.packtpub.com/product/the-machine-learning-solutions-architect-handbook/9781801072168) (Packt)\n\n## Events\n\n* [apply() - The ML data engineering conference](https://www.applyconf.com/)\n* [MLOps Conference - Keynotes and Panels](https://www.youtube.com/playlist?list=PLH8M0UOY0uy6d_n3vEQe6J_gRBUrISF9m)\n* [MLOps World: Machine Learning in Production Conference](https://mlopsworld.com/)\n* [NormConf - The Normcore Tech Conference](https://normconf.com/)\n* [Stanford MLSys Seminar Series](https://mlsys.stanford.edu/)\n\n## Other Lists\n\n* [Applied ML](https://github.com/eugeneyan/applied-ml)\n* [Awesome AutoML Papers](https://github.com/hibayesian/awesome-automl-papers)\n* [Awesome AutoML](https://github.com/windmaple/awesome-AutoML)\n* [Awesome Data Science](https://github.com/academic/awesome-datascience)\n* [Awesome DataOps](https://github.com/kelvins/awesome-dataops)\n* [Awesome Deep Learning](https://github.com/ChristosChristofidis/awesome-deep-learning)\n* [Awesome Game Datasets](https://github.com/leomaurodesenv/game-datasets) (includes AI content)\n* [Awesome Machine Learning](https://github.com/josephmisiti/awesome-machine-learning)\n* [Awesome MLOps](https://github.com/visenger/awesome-mlops)\n* [Awesome Production Machine Learning](https://github.com/EthicalML/awesome-production-machine-learning)\n* [Awesome Python](https://github.com/vinta/awesome-python)\n* [Deep Learning in Production](https://github.com/ahkarami/Deep-Learning-in-Production)\n\n## Podcasts\n\n* [AI Stories Podcast](https://www.youtube.com/@aistoriespodcast)\n* [Kubernetes Podcast from Google](https://kubernetespodcast.com/)\n* [Machine Learning – Software Engineering Daily](https://podcasts.google.com/?feed=aHR0cHM6Ly9zb2Z0d2FyZWVuZ2luZWVyaW5nZGFpbHkuY29tL2NhdGVnb3J5L21hY2hpbmUtbGVhcm5pbmcvZmVlZC8)\n* [MLOps.community](https://podcasts.google.com/?feed=aHR0cHM6Ly9hbmNob3IuZm0vcy8xNzRjYjFiOC9wb2RjYXN0L3Jzcw)\n* [Pipeline Conversation](https://podcast.zenml.io/)\n* [Practical AI: Machine Learning, Data Science](https://changelog.com/practicalai)\n* [This Week in Machine Learning \u0026 AI](https://twimlai.com/)\n* [True ML Talks](https://www.youtube.com/playlist?list=PL4-eEhdXDO5F9Myvh41EeUh7oCgzqFRGk)\n\n## Slack\n\n* [Kubeflow Workspace](https://kubeflow.slack.com/#/)\n* [MLOps Community Wokspace](https://mlops-community.slack.com)\n\n## Websites\n\n* [A guide to MLOps](https://mlops.swiss-ai-center.ch/)\n* [Feature Stores for ML](http://featurestore.org/)\n* [Made with ML](https://github.com/GokuMohandas/Made-With-ML)\n* [ML-Ops](https://ml-ops.org/)\n* [MLOps Community](https://mlops.community/)\n* [MLOps Guide](https://mlops-guide.github.io/)\n* [MLOps Now](https://mlopsnow.com)\n\n# Contributing\n\nAll contributions are welcome! Please take a look at the [contribution guidelines](https://github.com/kelvins/awesome-mlops/blob/main/CONTRIBUTING.md) first.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fkelvins%2Fawesome-mlops","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fkelvins%2Fawesome-mlops","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fkelvins%2Fawesome-mlops/lists"}