Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/awjuliani/meta-rl

Implementation of Meta-RL A3C algorithm
https://github.com/awjuliani/meta-rl

reinforcement-learning tensorflow

Last synced: 6 days ago
JSON representation

Implementation of Meta-RL A3C algorithm

Host: GitHub
URL: https://github.com/awjuliani/meta-rl
Owner: awjuliani
License: mit
Created: 2017-01-28T21:43:45.000Z (about 8 years ago)
Default Branch: master
Last Pushed: 2017-02-22T17:12:45.000Z (almost 8 years ago)
Last Synced: 2025-01-29T15:07:39.191Z (14 days ago)
Topics: reinforcement-learning, tensorflow
Language: Jupyter Notebook
Homepage:
Size: 814 KB
Stars: 402
Watchers: 26
Forks: 109
Open Issues: 4
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# Meta-RL
Tensorflow implementation of Meta-RL A3C algorithm taken from [Learning to Reinforcement Learn](https://arxiv.org/abs/1611.05763).
For more information, as well as explainations of each of the experiments, see my corresponding [Medium post](https://medium.com/p/b15b592a2ddf). A3C is built from previous implementation available [here](https://github.com/awjuliani/DeepRL-Agents).

Contains iPython notebooks for:

* **A3C-Meta-Bandit** - Set of bandit tasks described in paper. Including: Independent, Dependent, and Restless bandits.
* **A3C-Meta-Context** - Rainbow bandit task using randomized colors to indicate reward-giving arm in each episode.
* **A3C-Meta-Grid** - Rainbow Gridworld task; a variation of gridworld in which goal colors are randomzied each episode and must be learned "on the fly."