Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/sayakpaul/dual-deployments-on-vertex-ai

Project demonstrating dual model deployment scenarios using Vertex AI (GCP).
https://github.com/sayakpaul/dual-deployments-on-vertex-ai

automl gcp keras kubeflow mlops tensorflow tfx vertex-ai

Last synced: about 1 month ago
JSON representation

Project demonstrating dual model deployment scenarios using Vertex AI (GCP).

Host: GitHub
URL: https://github.com/sayakpaul/dual-deployments-on-vertex-ai
Owner: sayakpaul
License: apache-2.0
Created: 2021-07-29T07:47:51.000Z (over 3 years ago)
Default Branch: main
Last Pushed: 2021-12-28T09:04:31.000Z (about 3 years ago)
Last Synced: 2025-01-09T23:51:29.540Z (about 1 month ago)
Topics: automl, gcp, keras, kubeflow, mlops, tensorflow, tfx, vertex-ai
Language: Jupyter Notebook
Homepage: https://cloud.google.com/blog/topics/developers-practitioners/dual-deployments-vertex-ai
Size: 1.84 MB
Stars: 35
Watchers: 6
Forks: 8
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# Dual-Deployments-on-Vertex-AI

_By [Chansung Park](https://github.com/deep-diver) and Sayak Paul_

This project demonstrates a workflow to cover dual model deployment scenarios using [Kubeflow](https://www.kubeflow.org/),
[TensorFlow Extended (TFX)](https://www.tensorflow.org/tfx), and [Vertex AI](https://cloud.google.com/vertex-ai). We suggest
reading the accompanying [blog post](https://cloud.google.com/blog/topics/developers-practitioners/dual-deployments-vertex-ai) first
to get an idea and then following along with the code. This project also received the [#TFCommunitySpotlight Award](https://twitter.com/TensorFlow/status/1446611368078086144?s=20).

## Motivation 💻

Let's say you want to allow your users to run an application both in online and offline mode. Your mobile
application would use a TFLite model depending on the network bandwidth/battery etc., and if sufficient
network coverage/internet bandwidth is available your application would instead use the online cloud one. This way
your application stays resilient and can ensure high availability.

Sometimes we also do layered predictions where we first divide a problem into smaller tasks:
1) predict if it's a yes/no,
2) depending on the output of 1) we run the final model.

In these cases, 1) takes place on-device and 2) takes place on the cloud to ensure a smooth UX. Furthermore, it's
a good practice to use a mobile-friendly network architecture (such as MobileNet) when considering
mobile deployments. This leads us to the following question:

_**Can we train two different models within the same deployment pipeline and manage them seamlessly?**_

This project is motivated by this question.

## AutoML, TFX, etc. 🛠

Different organizations have people with varied technical backgrounds. We wanted to provide the easiest solution first
and then move on to something that is more customizable. To this end, we leverage [Kubeflow's AutoML SDKs](https://github.com/kubeflow/pipelines/tree/master/components/google-cloud) to build, train, and deploy models with
different production use-cases. With AutoML, the developers can delegate a large part of their workflows to the SDKs
and the codebase also stays comparatively smaller. The figure below depicts a sample system architecture for
this scenario:

![](figures/sample_architecture.png)

**^{Figure developed by Chansung Park.}**

But the story does not end here. What if we wanted to have better control over the models to be built, trained,
and deployed? Enter TFX! TFX provides the flexibility of writing custom components and including them inside a
pipeline. This way Machine Learning Engineers can focus on building and training their favorite models and delegate
a part of the heavy lifting to TFX and Vertex AI. On Vertex AI (acting as an orchestrator) this pipeline will look like
so:

![](https://i.ibb.co/98Ry74n/Screen-Shot-2021-08-06-at-1-43-35-AM.png)

```txt
🔥 In this project we cover both these situations.
```

## Code 🆘

Our code is distributed as Colab Notebooks. But one needs to have a billing-enabled GCP account
(with a few APIs enabled) to successfully run these notebooks. Alternatively one can also use the
notebooks on [Vertex AI Notebooks](https://cloud.google.com/vertex-ai/docs/general/notebooks). Find
all the notebooks and their descriptions here:
[`notebooks`](https://github.com/sayakpaul/Dual-Deployments-on-Vertex-AI/tree/main/notebooks).

Additionally, you can find the custom TFX components separately here - [`custom_components`](https://github.com/sayakpaul/Dual-Deployments-on-Vertex-AI/tree/main/custom_components).

## Acknowledgements

[ML-GDE program](https://developers.google.com/programs/experts/) for providing GCP credits. Thanks to [Karl Weinmeister](https://twitter.com/kweinmeister?lang=hr) and [Robert Crowe](https://twitter.com/robert_crowe?lang=en) for providing review feedback on this project.