https://github.com/ossamaahmed/blackbox_mpc

Sampling based Model Predictive Control package for Model-Based RL research
https://github.com/ossamaahmed/blackbox_mpc

derivative-free-optimization model-based-rl model-predictive-control reinforcement-learning sampling-based-planning

Last synced: 10 months ago
JSON representation

Sampling based Model Predictive Control package for Model-Based RL research

Host: GitHub
URL: https://github.com/ossamaahmed/blackbox_mpc
Owner: ossamaAhmed
Created: 2020-10-14T12:36:29.000Z (almost 6 years ago)
Default Branch: master
Last Pushed: 2020-10-20T11:58:31.000Z (almost 6 years ago)
Last Synced: 2025-07-30T02:01:37.514Z (12 months ago)
Topics: derivative-free-optimization, model-based-rl, model-predictive-control, reinforcement-learning, sampling-based-planning
Language: Python
Homepage:
Size: 43.1 MB
Stars: 55
Watchers: 1
Forks: 7
Open Issues: 4
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

          # BlackBox MPC (Model Predictive Control)

[![license](https://img.shields.io/github/license/DAVFoundation/captain-n3m0.svg?style=flat-square)](https://github.com/DAVFoundation/captain-n3m0/blob/master/LICENSE)

[![GitHub release](https://img.shields.io/github/release/ossamaAhmed/blackbox_mpc/all.svg)](https://github.com/ossamaAhmed/blackbox_mpc/releases)

[![Documentation Status](https://readthedocs.org/projects/blackbox-mpc/badge/?version=latest)](https://blackbox-mpc.readthedocs.io/en/latest/index.html)

[![Maintenance](https://img.shields.io/badge/Maintained%3F-yes-green.svg)](https://github.com/ossamaAhmed/blackbox_mpc/graphs/commit-activity)

[![PR](https://camo.githubusercontent.com/f96261621753dacf526590825b84f87ccb1db0e6/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f5052732d77656c636f6d652d627269676874677265656e2e7376673f7374796c653d666c6174)](https://github.com/ossamaAhmed/blackbox_mpc/pulls)

[![Open Source Love png2](https://camo.githubusercontent.com/60dcf2177b53824e7912a6adfb3ff5e318d14ae4/68747470733a2f2f6261646765732e66726170736f66742e636f6d2f6f732f76312f6f70656e2d736f757263652e706e673f763d313033)](https://github.com/ossamaAhmed/blackbox_mpc)

## Description

This package provides a framework of different derivative-free optimizers (powered by [Tensorflow 2.0.0](https://www.tensorflow.org/)) which can be used in

conjuction with an MPC (model predictive controller) and an analytical/ learned dynamics model 

to control an agent in a gym environment.













| **Derivative-Free Optimizer**                | **BlackBox MPC**              |

| --------------------------- | --------------------------------- |

| Cross-Entropy Method (CEM)            | :heavy_check_mark:                |

| Covariance Matrix Adaptation Evolutionary-Strategy (CMA-ES) | :heavy_check_mark:                |

| Path Intergral Method (PI2)         | :heavy_check_mark:                |

| Particle Swarm Optimizer (PSO)        | :heavy_check_mark:                | 

| Random Search (RandomSearch) | :heavy_check_mark:                |

| Simultaneous Perturbation Stochastic Approximation   (SPSA)   | :heavy_check_mark:                |

The package features other functionalities to aid in model-based reinforcement learning (RL) research such as:

- Parallel implementation of the different optimizers using Tensorflow 2.0

- Loading/ saving system dynamics model.

- Monitoring progress using tensorboard.

- Learning dynamics functions.

- Recording videos.

- A modular and flexible interface design to enable research on different trajectory evaluation methods, optimizers, cost functions, system dynamics network architectures or even training algorithms. 

 







Optimizers references:

- [CEM](http://web.mit.edu/6.454/www/www_fall_2003/gew/CEtutorial.pdf)

- [CMA-ES](https://arxiv.org/pdf/1604.00772.pdf)

- [PI2](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7989202)

- [PSO](https://www.cs.tufts.edu/comp/150GA/homeworks/hw3/_reading6%201995%20particle%20swarming.pdf)

- [SPSA](https://www.jhuapl.edu/SPSA/PDF-SPSA/Spall_Stochastic_Optimization.PDF)

## Iterative MPC



 



## Installation

### Install as a pip package from latest release

```bash

pip install blackbox_mpc

```

### Install from source

```bash

git clone https://github.com/ossamaAhmed/blackbox_mpc.git

cd blackbox_mpc

pip install -e .

```

### To use GPU (recommended for faster inference)

```bash

pip install tensorflow_gpu==2.0.0

```

## Usage

The easiest way to get familiar with the framework is to run through the [tutorials](https://github.com/ossamaAhmed/blackbox_mpc/tree/master/tutorials) provided. An example is shown below:

```python

from blackbox_mpc.policies.mpc_policy import \

    MPCPolicy

from blackbox_mpc.utils.pendulum import PendulumTrueModel, \

    pendulum_reward_function

import gym

env = gym.make("Pendulum-v0")

mpc_policy = MPCPolicy(reward_function=pendulum_reward_function,

                       env_action_space=env.action_space,

                       env_observation_space=env.observation_space,

                       true_model=True,

                       dynamics_function=PendulumTrueModel(),

                       optimizer_name='RandomSearch',

                       num_agents=1)

current_obs = env.reset()

for t in range(200):

    action_to_execute, expected_obs, expected_reward = mpc_policy.act(

        current_obs, t)

    current_obs, reward, _, info = env.step(action_to_execute)

    env.render()

```

## Documentation

An API specification and explanation of the code components can be found [here](https://blackbox-mpc.readthedocs.io/en/latest/).

## Visualize Training







## Authors

blackbox_mpc is work done by [Ossama Ahmed (ETH Zürich)](https://ossamaahmed.github.io/), [Jonas Rothfuss (ETH Zürich)](https://las.inf.ethz.ch/people/jonas-rothfuss) and [Prof. Andreas Krause (ETH Zurich)](https://las.inf.ethz.ch/krausea).

This package was developed at the [Learning and Adaptive Systems Lab](https://causal-world.readthedocs.io/en/latest/index.html) @ETH Zurich.

## If you use the package, please cite blackbox_mpc

```

@misc{blackbox_mpc,

   author = {Ahmed, Ossama and Rothfuss, Jonas and Krause, Andreas},

   year = {2020},

   publisher = {GitHub},

   journal = {GitHub repository},

   howpublished = {\url{https://github.com/ossamaAhmed/blackbox_mpc}},

}

```

## License

The code is licenced under the MIT license and free to use by anyone without any restrictions.

## TODO

- Add bayesian neural networks (BNN) and graph neural networks (GNN) support.

- Add different trajectory evaluators to propagate uncertainities support.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/ossamaahmed/blackbox_mpc

Awesome Lists containing this project

README