Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/lazauk/azureml-featurestore-cli
Tutorial on setup of online and offline feature stores in Azure Machine Learning (Azure ML), using command line interface (Az CLI)
https://github.com/lazauk/azureml-featurestore-cli
adls ai azure feature-store machine-learning offline online redis
Last synced: 17 days ago
JSON representation
Tutorial on setup of online and offline feature stores in Azure Machine Learning (Azure ML), using command line interface (Az CLI)
- Host: GitHub
- URL: https://github.com/lazauk/azureml-featurestore-cli
- Owner: LazaUK
- License: mit
- Created: 2024-09-05T19:24:35.000Z (4 months ago)
- Default Branch: main
- Last Pushed: 2024-09-11T00:42:00.000Z (4 months ago)
- Last Synced: 2024-11-07T13:42:11.627Z (about 2 months ago)
- Topics: adls, ai, azure, feature-store, machine-learning, offline, online, redis
- Homepage:
- Size: 328 KB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Azure Machine Learning - Feature Store (CLI Setup)
Ready to boost your machine learning projects? Azure Machine Learning (Azure ML) offers a centralised repository called _**Feature Store**_ to simplify feature engineering. It lets you store, manage and efficiently reuse features across projects, saving time and efforts.
This guide will show you how to set up an Azure ML Feature Store with online and offline materialisation using the **Azure CLI** (Az CLI). The online store is powered by the in-memory capacity of _Azure Redis Cache_, while the offline store utilises the 2nd generation of Azure Data Lake Store (_ADLS Gen2_).
Here's what you'll need:
- An Azure subscription,
- Az CLI installed (we'll cover its ML extension installation in the first step).## Table of contents:
- [Step 1: Installing Az CLI's ML extension](https://github.com/LazaUK/AzureML-FeatureStore-CLI#step-1-installing-az-clis-ml-extension)
- [Step 2: Creating ADLS Gen2 storage account](https://github.com/LazaUK/AzureML-FeatureStore-CLI#step-2-creating-adls-gen2-storage-account)
- [Step 3: Creating container on ADLS Gen2 storage](https://github.com/LazaUK/AzureML-FeatureStore-CLI#step-3-creating-container-on-adls-gen2-storage)
- [Step 4: Creating Redis Cache instance](https://github.com/LazaUK/AzureML-FeatureStore-CLI#step-4-creating-redis-cache-instance)
- [Step 5: Creating user-assigned Managed Identity](https://github.com/LazaUK/AzureML-FeatureStore-CLI#step-5-creating-user-assigned-managed-identity)
- [Step 6: Creating Azure ML feature store](https://github.com/LazaUK/AzureML-FeatureStore-CLI#step-6-creating-azure-ml-feature-store)
- [HOUSEKEEPING: Deleting feature store](https://github.com/LazaUK/AzureML-FeatureStore-CLI#housekeeping-deleting-feature-store)## Step 1: Installing Az CLI's ML extension
To interact with Azure ML through Az CLI, you need to install the ML extension:
``` Bash
az extension add --name ml
```## Step 2: Creating ADLS Gen2 storage account
Now create an ADLS Gen2 account, that will be used as the default storage account by your feature store:
``` Bash
az storage account create --name --enable-hierarchical-namespace true --resource-group --location --subscription
```
> Note: Replace ``````, ``````, `````` and `````` with required Storage account values.## Step 3: Creating container on ADLS Gen2 storage
Once you've created a Storage account, you can setup a blob container that will be used by the feature store for offline materialisation:
``` Bash
az storage fs create --name --account-name --subscription --connection-string
```
> Note: Replace ``````, ``````, `````` and `````` with required Storage account's container values.## Step 4: Creating Redis Cache instance
If you want your ML models to access features with low latency, create a Redis Cache instance for online materialisation:
``` Bash
az redis create --name --resource-group --location --sku --vm-size
```
> Note: Replace ``````, ``````, ``````, `````` and `````` with required Redis Cache resource's values.## Step 5: Creating user-assigned Managed Identity
Azure can automatically create a managed identity for your feature store. Alternatively, you can pre-provision your own, e.g to follow corporate naming conventions:
```
az identity create --name --resource-group --location --subscription
```
> Note: Replace ``````, ``````, `````` and `````` with required managed identity's values.## Step 6: Creating Azure ML feature store
The last step is to create the feature store itself.1. Download the provided [YAML template](./FeatureStore_Online_Offline.yaml) and update it with details from your previous steps (storage, Redis Cache and managed identity):
``` YAML
$schema: http://azureml/sdk-2-0/FeatureStore.json# General configuration of Azure ML feature store
name: feature-store-online-offline
display_name: "Feature store with both online and offline materialisation"
resource_group:
location:
tags:
author: Laziz_Turakulov# Apache Spark configuration
compute_runtime:
spark_runtime_version: "3.4"# Details of user-assigned managed identity
materialization_identity:
client_id: xxxxxxxx-xxxx-xxxx-xxxx-xxxxxxxxxxxx
principal_id: xxxxxxxx-xxxx-xxxx-xxxx-xxxxxxxxxxxx
resource_id: "/subscriptions//resourceGroups//providers/Microsoft.ManagedIdentity/userAssignedIdentities/"# Details of Azure Redis Cache (online materialisation)
online_store:
type: redis
target: "/subscriptions//resourceGroups//providers/Microsoft.Cache/Redis/"# Details of ADLS Gen2 Storage account (offline materialisation)
offline_store:
type: azure_data_lake_gen2
target: "/subscriptions//resourceGroups//providers/Microsoft.Storage/storageAccounts//blobServices/default/containers/"
# Details of default Storage account
storage_account: "/subscriptions//resourceGroups//providers/Microsoft.Storage/storageAccounts/"
```
2. Then run the following Az CLI command to create the feature store:
``` Bash
az ml feature-store create --resource-group --file FeatureStore_Online_Offline.yaml
```
3. Verify successful creation by checking the Azure ML Studio UI for the newly created feature store.
![AzureML_FeatureStore_Home](images/AzureML_FeatureStore_Home.png)
4. Your managed identity should have been assigned the following roles:
- ```AzureML Data Scientist``` to Azure ML feature store,
- ```Storage Blob Data Contributor``` to default Storage account,
- ```Storage Blob Data Contributor``` to offline store's Blob container,
- ```Contributor``` to online store's Redis Cache resource.
![AzureML_FeatureStore_MI](images/AzureML_FeatureStore_MI.png)## HOUSEKEEPING: Deleting feature store
While the Azure ML Studio UI currently doesn't support deleting feature stores directly, you can use the following Az CLI command:
``` Bash
az ml feature-store delete --name --resource-group --all-resources
```