https://github.com/nvidia-merlin/gcp-ml-ops
MLOps pipeline for NVIDIA Merlin on GKE
https://github.com/nvidia-merlin/gcp-ml-ops
Last synced: about 1 year ago
JSON representation
MLOps pipeline for NVIDIA Merlin on GKE
- Host: GitHub
- URL: https://github.com/nvidia-merlin/gcp-ml-ops
- Owner: NVIDIA-Merlin
- License: apache-2.0
- Created: 2021-04-14T08:18:39.000Z (about 5 years ago)
- Default Branch: main
- Last Pushed: 2021-06-10T21:40:22.000Z (about 5 years ago)
- Last Synced: 2025-04-18T00:58:03.320Z (about 1 year ago)
- Language: Python
- Homepage:
- Size: 479 KB
- Stars: 41
- Watchers: 5
- Forks: 6
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Merlin - MLOps on GKE
## Introduction
[NVIDIA Merlin](https://developer.nvidia.com/nvidia-merlin) is an open-source application framework that facilitates the development and deployment of large-scale deep recommender systems on GPUs.
The figure below shows the architecture of a recommendation system example using NVIDIA Merlin on a [Kubeflow pipeline](https://www.kubeflow.org/docs/components/pipelines/overview/pipelines-overview/).
Through this, we intend to show an end-to-end reference architecture, all the way from data preparation, to model deployment, with features like continuous and fast re-training, autoscaling, and model monitoring.

For this example, we use the [Criteo 1TB Click Logs](https://ailab.criteo.com/download-criteo-1tb-click-logs-dataset/) dataset, a large publicly available dataset for recommender systems. It contains feature values and click feedback for millions of display ads. It is divided into 24 files, each one corresponding to one day of data.
## Running the example
Please follow the User Guide available [here](https://docs.google.com/document/d/1P_BerGSP5CNzGjGbRqgMrPcNaCmQuKUyodFaG0jlu9I/edit?usp=sharing)!