https://github.com/dragen1860/maml-pytorch-rl

Last synced: 5 months ago
JSON representation

Host: GitHub
URL: https://github.com/dragen1860/maml-pytorch-rl
Owner: dragen1860
License: mit
Created: 2018-08-04T08:41:41.000Z (about 7 years ago)
Default Branch: master
Last Pushed: 2023-06-16T16:30:12.000Z (over 2 years ago)
Last Synced: 2025-05-07T08:08:45.124Z (5 months ago)
Language: Python
Size: 4.65 MB
Stars: 31
Watchers: 3
Forks: 11
Open Issues: 2
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

          # Reinforcement Learning with Model-Agnostic Meta-Learning (MAML)

![HalfCheetahDir](https://raw.githubusercontent.com/tristandeleu/pytorch-maml-rl/master/_assets/halfcheetahdir.gif)

Implementation of Model-Agnostic Meta-Learning (MAML) applied on Reinforcement Learning problems in Pytorch. This repository includes environments introduced in ([Duan et al., 2016](https://arxiv.org/abs/1611.02779), [Finn et al., 2017](https://arxiv.org/abs/1703.03400)): multi-armed bandits, tabular MDPs, continuous control with MuJoCo, and 2D navigation task.

## Getting started

To avoid any conflict with your existing Python setup, and to keep this project self-contained, it is suggested to work in a virtual environment with [`virtualenv`](http://docs.python-guide.org/en/latest/dev/virtualenvs/). To install `virtualenv`:

```

pip install --upgrade virtualenv

```

Create a virtual environment, activate it and install the requirements in [`requirements.txt`](requirements.txt).

```

virtualenv venv

source venv/bin/activate

pip install -r requirements.txt

```

## Usage

You can use the [`main.py`](main.py) script in order to run reinforcement learning experiments with MAML. This script was tested with Python 3.5. Note that some environments may also work with Python 2.7 (all experiments besides MuJoCo-based environments).

```

python main.py --env-name HalfCheetahDir-v1 --num-workers 8 --fast-lr 0.1 --max-kl 0.01 --fast-batch-size 20 --meta-batch-size 40 --num-layers 2 --hidden-size 100 --num-batches 1000 --gamma 0.99 --tau 1.0 --cg-damping 1e-5 --ls-max-steps 15 --output-folder maml-halfcheetah-dir --device cuda

```

## References

This project is, for the most part, a reproduction of the original implementation [cbfinn/maml_rl](https://github.com/cbfinn/maml_rl/) in Pytorch. These experiments are based on the paper

> Chelsea Finn, Pieter Abbeel, and Sergey Levine. Model-agnostic meta-learning for fast adaptation of deep

networks. _International Conference on Machine Learning (ICML)_, 2017 [[ArXiv](https://arxiv.org/abs/1703.03400)]

If you want to cite this paper

```

@article{DBLP:journals/corr/FinnAL17,

  author    = {Chelsea Finn and Pieter Abbeel and Sergey Levine},

  title     = {Model-{A}gnostic {M}eta-{L}earning for {F}ast {A}daptation of {D}eep {N}etworks},

  journal   = {International Conference on Machine Learning (ICML)},

  year      = {2017},

  url       = {http://arxiv.org/abs/1703.03400}

}

```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/dragen1860/maml-pytorch-rl

Awesome Lists containing this project

README