An open API service indexing awesome lists of open source software.

https://github.com/edgeimpulse/example-custom-ml-block-scikit

Scikit learn block
https://github.com/edgeimpulse/example-custom-ml-block-scikit

Last synced: 11 months ago
JSON representation

Scikit learn block

Awesome Lists containing this project

README

          

# Custom scikit-learn ML block example for Edge Impulse

This repository is an example on how to [add a custom learning block](https://docs.edgeimpulse.com/docs/adding-custom-transfer-learning-models) to Edge Impulse. This repository contains a small logistic regression model built with scikit-learn.

NOTE: Only the following scikit-learn models are supported via the pickle conversion workflow presented below:
- [Support vector Classification](https://scikit-learn.org/1.5/modules/generated/sklearn.svm.SVC.html) with linear kernel
- [Linear regression](https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LinearRegression.html)
- [Ridge](https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.Ridge.html)
- [RidgeCV](https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.RidgeCV.html)
- [Random forest regresssor](https://scikit-learn.org/stable/modules/generated/sklearn.ensemble.RandomForestRegressor.html)
- [Logistic regression](https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LogisticRegression.html)
- [Ridge classifier](https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.RidgeClassifier.html)
- [Ridge classifier CV](https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.RidgeClassifierCV.html)
- [Random forest classifier](https://scikit-learn.org/stable/modules/generated/sklearn.ensemble.RandomForestClassifier.html)

As a primer, read the [Custom learning blocks](https://docs.edgeimpulse.com/docs/edge-impulse-studio/learning-blocks/adding-custom-learning-blocks) page in the Edge Impulse docs.

> **Note on epochs**: You'll want a lot of training cycles (1,000+ or so) here.

## Running the pipeline

You run this pipeline via Docker. This encapsulates all dependencies and packages for you.

### Running via Docker

1. Install [Docker Desktop](https://www.docker.com/products/docker-desktop/).
2. Install the [Edge Impulse CLI](https://docs.edgeimpulse.com/docs/edge-impulse-cli/cli-installation) v1.16.0 or higher.
3. Create a new Edge Impulse project, and add data from the [continuous gestures](https://docs.edgeimpulse.com/docs/continuous-gestures) dataset.
4. Under **Create impulse** add a 'Spectral features' processing block, and a random ML block.
5. Open a command prompt or terminal window.
6. Initialize the block:

```
$ edge-impulse-blocks init
```

7. Fetch new data via:

```
$ edge-impulse-blocks runner --download-data data/
```

8. Build the container:

```
$ docker build -t custom-ml-scikit .
```

9. Run the container to test the script (you don't need to rebuild the container if you make changes):

```
$ docker run --rm -v $PWD:/app custom-ml-scikit --data-directory /app/data --epochs 30 --learning-rate 0.01 --out-directory out/
```

10. This creates a .pkl file in the 'out' directory. This is a pickle file of the sklearn model which can be reinstantiated in python. The file will be picked up in the custom block and converted to a TFLite model within studio.

#### Adding extra dependencies

If you have extra packages that you want to install within the container, add them to `requirements.txt` and rebuild the container.

#### Adding new arguments

To add new arguments, see [Custom learning blocks > Arguments to your script](https://docs.edgeimpulse.com/docs/edge-impulse-studio/learning-blocks/adding-custom-learning-blocks#arguments-to-your-script).

## Fetching new data

To get up-to-date data from your project:

1. Install the [Edge Impulse CLI](https://docs.edgeimpulse.com/docs/edge-impulse-cli/cli-installation) v1.16 or higher.
2. Open a command prompt or terminal window.
3. Fetch new data via:

```
$ edge-impulse-blocks runner --download-data data/
```

## Pushing the block back to Edge Impulse

You can also push this block back to Edge Impulse, that makes it available like any other ML block so you can retrain your model when new data comes in, or deploy the model to device.

1. Push the block:

```
$ edge-impulse-blocks push
```

2. The block is now available under any of your projects via **Create impulse > Add new learning block**.

## Changing the block type (e.g. regression)

If you want to change the block type because you're classifying a different data type, or build a model with a different output format, run:

```
$ rm parameters.json .ei-block-config
$ edge-impulse-blocks init
```

And answer the wizard. This'll create a new parameters.json file.