An open API service indexing awesome lists of open source software.

https://github.com/paulescu/backfill-feature-store-with-prefect

Backfill historical OHLC feature in a Feature Store (Hopsworks) using an orchestration tool (Prefect).
https://github.com/paulescu/backfill-feature-store-with-prefect

backfill dataengineering hopsworks machine-learning ml mlops prefect

Last synced: about 2 months ago
JSON representation

Backfill historical OHLC feature in a Feature Store (Hopsworks) using an orchestration tool (Prefect).

Awesome Lists containing this project

README

          


Feature backfilling


Backfill a feature group using Prefect and Hopsworks



Let's connect 🤗


Twitter •
LinkedIn •
Newsletter




#### Table of contents
1. [What is this repo about?](#what-is-this-repo-about)
2. [How to run this code](#how-to-run-this-code)
3. [Wannna build real-world ML products?](#wannna-build-real-world-ml-products)

## What is this repo about?
This repository shows how to backfill historical OHLC feature in a Feature Store (Hopsworks) using an orchestration tool (Prefect).

## How to run this code

- Create a Python virtual environment with the project dependencies with
```
$ make init
```

- Connect to your Prefect Cloud
```
$ prefect cloud login
```

- Set environment variables necessary to talk to your Hopsworks feature store
```
$ . ./set_hopsworks_credentials.sh
```

- Backfill OHLC data for a range of dates (e.g. from `2023-01-01` to `2023-01-31`)
```
$ make from_day=2023-01-01 to_day=2023-01-31 backfill
```

## Wannna build real-world ML products?

Check the [Real-World ML Program](https://realworldmachinelearning.carrd.co/), a hands-on, 3-hour course where you will learn
how to design, build, [deploy](https://taxi-demand-predictor.streamlit.app/), and [monitor](https://taxi-demand-predictor-monitoring.streamlit.app/) complete ML products.