https://github.com/baggepinnen/differentialdynamicprogramming.jl

A package for solving Differential Dynamic Programming and trajectory optimization problems.
https://github.com/baggepinnen/differentialdynamicprogramming.jl

ddp dynamic-programming model-predictive-control optimal-control trajectory-optimization

Last synced: 4 months ago
JSON representation

A package for solving Differential Dynamic Programming and trajectory optimization problems.

Host: GitHub
URL: https://github.com/baggepinnen/differentialdynamicprogramming.jl
Owner: baggepinnen
License: other
Created: 2016-03-01T10:42:43.000Z (over 9 years ago)
Default Branch: master
Last Pushed: 2021-05-13T19:34:37.000Z (about 4 years ago)
Last Synced: 2025-02-22T12:06:18.357Z (5 months ago)
Topics: ddp, dynamic-programming, model-predictive-control, optimal-control, trajectory-optimization
Language: Julia
Size: 271 KB
Stars: 76
Watchers: 9
Forks: 15
Open Issues: 4
Metadata Files:
- Readme: README.md
- License: LICENSE.md

Awesome Lists containing this project

README

        # DifferentialDynamicProgramming

[![Build Status](https://travis-ci.org/baggepinnen/DifferentialDynamicProgramming.jl.svg?branch=master)](https://travis-ci.org/baggepinnen/DifferentialDynamicProgramming.jl)

[![Coverage Status](https://coveralls.io/repos/github/baggepinnen/DifferentialDynamicProgramming.jl/badge.png?branch=master)](https://coveralls.io/github/baggepinnen/DifferentialDynamicProgramming.jl?branch=master)

## Installation

The package is registered and can be added with  

`] add DifferentialDynamicProgramming`  

The latest version is formally compatible with Julia v1.1+ (but probably works well for julia v1.0 as well if you `dev` it).

## Demo functions

The following demo functions are provided

`demo_linear()`     To run the iLQG DDP algorithm on a simple linear problem  

`demoQP`            To solve a demo quadratic program  

`demo_pendcart()`   Where a pendulum attached to a cart is simulated.

## Usage

### Demo linear

See demo file `demo_linear.jl` for a usage example.

```julia

# make stable linear dynamics

h = .01         # time step

n = 10          # state dimension

m = 2           # control dimension

A = randn(n,n)

A = A-A'        # skew-symmetric = pure imaginary eigenvalues

A = exp(h*A)    # discrete time

B = h*randn(n,m)

# quadratic costs

Q    = h*eye(n)

R    = .1*h*eye(m)

# control limits

lims = [] #ones(m,1)*[-1 1]*.6

T    = 1000             # horizon

x0   = ones(n,1)        # initial state

u0   = .1*randn(m,T)    # initial controls

# optimization problem

N    = T+1

fx   = A

fu   = B

cxx  = Q

cxu  = zeros(size(B))

cuu  = R

# Specify dynamics functions

function lin_dyn_df(x,u,Q,R)

    u[isnan(u)] = 0

    cx  = Q*x

    cu  = R*u

    fxx=fxu=fuu = []

    return fx,fu,fxx,fxu,fuu,cx,cu,cxx,cxu,cuu

end

function lin_dyn_f(x,u,A,B)

    u[isnan(u)] = 0

    xnew = A*x + B*u

    return xnew

end

function lin_dyn_cost(x,u,Q)

    c = 0.5*sum(x.*(Q*x)) + 0.5*sum(u.*(R*u))

    return c

end

f(x,u,i)     = lin_dyn_f(x,u,A,B,Q,R)

costfun(x,u) = lin_dyn_cost(x,u,Q)

df(x,u)      = lin_dyn_df(x,u,Q,R)

# run the optimization

@time x, u, L, Vx, Vxx, cost, otrace = iLQG(f, costfun ,df, x0, u0, lims=lims);

```

### Demo pendulum on cart

There is an additional demo function `demo_pendcart()`, where a pendulum attached to a cart is simulated. In this example, regular LQG control fails in stabilizing the pendulum at the upright position due to control limitations. The DDP-based optimization solves this by letting the pendulum fall, and increases the energy in the pendulum during the fall such that it will stay upright after one revolution.

![window](images/states_pendcart.png)

![window](images/control_pendcart.png)

# Citing

This code consists of a port and extensions of a MATLAB library provided by the autors of

```

BIBTeX:

@INPROCEEDINGS{

  author    = {Tassa, Y. and Mansard, N. and Todorov, E.},

  booktitle = {Robotics and Automation (ICRA), 2014 IEEE International Conference on},

  title     = {Control-Limited Differential Dynamic Programming},

  year      = {2014}, month={May}, doi={10.1109/ICRA.2014.6907001}}

  http://www.mathworks.com/matlabcentral/fileexchange/52069-ilqg-ddp-trajectory-optimization

  http://www.cs.washington.edu/people/postdocs/tassa/

```

The code above was extended with KL-divergence constrained optimization for the thesis

[Bagge Carlson, F.](https://www.control.lth.se/staff/fredrik-bagge-carlson/), ["Machine Learning and System Identification for Estimation in Physical Systems"](https://lup.lub.lu.se/search/publication/ffb8dc85-ce12-4f75-8f2b-0881e492f6c0) (PhD Thesis 2018).

```bibtex

@thesis{bagge2018,

  title        = {Machine Learning and System Identification for Estimation in Physical Systems},

  author       = {Bagge Carlson, Fredrik},

  keyword      = {Machine Learning,System Identification,Robotics,Spectral estimation,Calibration,State estimation},

  month        = {12},

  type         = {PhD Thesis},

  number       = {TFRT-1122},

  institution  = {Dept. Automatic Control, Lund University, Sweden},

  year         = {2018},

  url          = {https://lup.lub.lu.se/search/publication/ffb8dc85-ce12-4f75-8f2b-0881e492f6c0},

}

```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/baggepinnen/differentialdynamicprogramming.jl

Awesome Lists containing this project

README