https://github.com/isl-org/metalearningtradeoffs

Source code for the NeurIPS 2020 Paper: Modeling and Optimization Trade-off in Meta-learning.
https://github.com/isl-org/metalearningtradeoffs

Last synced: over 1 year ago
JSON representation

Source code for the NeurIPS 2020 Paper: Modeling and Optimization Trade-off in Meta-learning.

Host: GitHub
URL: https://github.com/isl-org/metalearningtradeoffs
Owner: isl-org
License: other
Created: 2020-10-20T18:09:40.000Z (almost 6 years ago)
Default Branch: main
Last Pushed: 2024-06-26T21:23:00.000Z (about 2 years ago)
Last Synced: 2025-04-04T12:12:18.480Z (over 1 year ago)
Language: Python
Size: 53.6 MB
Stars: 4
Watchers: 8
Forks: 1
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

          # Modeling and Optimization Trade-off in Meta-learning

This repository contains the code used to obtain the experimental results in the paper [Modeling and Optimization Trade-off in Meta-learning](https://arxiv.org/abs/2010.12916), Gao and Sener (NeurIPS 2020).

It is based on the full_code branch of the [ProMP](https://github.com/jonasrothfuss/ProMP) repository.

The code is written in Python 3. The part corresponding to the linear regression experiment only requires [NumPy](https://numpy.org), while the part corresponding to the reinforcement learning experiments also requires [Tensorflow](https://www.tensorflow.org/) and the [Mujoco](http://www.mujoco.org/) physics engine.

Some of the reinforcement learning environments can be found in this repository, and the rest are from [MetaWorld](https://github.com/rlworkgroup/metaworld).

## Installation

Please follow the installation instructions provided by the [ProMP](https://github.com/jonasrothfuss/ProMP) repository and the [MetaWorld](https://github.com/rlworkgroup/metaworld) repository. 

For the latter, please use the api-rework branch for compatibility (this has already been added to requirements.txt).

## Running the experiments

### Linear regression

Execute

```

python3 linear_regression/run_experiment.py --p 1 --beta 2 --seed 1

```

The figures can then be found in the folder `p-1_beta-2_seed-1/figures`.

### Reinforcement learning

To create all the executable scripts that we need to run, execute

```

python3 experiments/benchmark/run.py

```

They will be found in the folder `scripts`.

The training scripts are of the form `algorithm_environment_mode_seed.sh`, and the testing scripts are of the form `test_algorithm_environment_mode_seed_checkpoint.sh`.

- `algorithm` is replaced by `ppo` (DRS+PPO), `promp` (ProMP), `trpo` (DRS+TRPO), `trpomaml` (TRPO-MAML).

- `environment` and `mode` are replaced by 

  - `walker` and `params-interpolate` (Walker2DRandParams) 

  - `walker` and `goal-interpolate` (Walker2DRandVel)

  - `cheetah` and `goal-interpolate` (HalfCheetahRandVel)

  - `hopper` and `params-interpolate` (HopperRandParams)

  - `metaworld` and `ml1-push` (ML1-Push)

  - `metaworld` and `ml1-reach` (ML1-Reach)

  - `metaworld` and `ml10` (ML10)

  - `metaworld` and `ml45` (ML45)

- `seed`, the random seed, is replaced by integers 1-5.

- `checkpoint`, the policies stored at various stages during training, is replaced by integers 0-20.

After all runs are finished, the figures can be created by executing

```

python3 experiments/benchmark/summary.py

```

They will be found in the folder `results`.

## Acknowledgements

We would like to thank Charles Packer for help during the creation of the code for the reinforcement learning experiments.

## Citation

To cite this repository in your research, please reference the following [paper](https://arxiv.org/abs/2010.12916):

> Katelyn Gao and Ozan Sener. Modeling and Optimization Trade-off in Meta-Learning. *arXiv preprint arXiv:2010.12916* (2020).

```TeX

@misc{GaoSener2020,

  Author = {Katelyn Gao and Ozan Sener},

  Title = {Modeling and Optimization Trade-off in Meta-Learning},

  Year = {2020},

  Eprint = {arXiv:2010.12916},

}

```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/isl-org/metalearningtradeoffs

Awesome Lists containing this project

README