Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/kashif/firedup
Clone of OpenAI's Spinning Up in PyTorch
https://github.com/kashif/firedup
deep-learning pytorch reinforcement-learning spinningup
Last synced: 3 months ago
JSON representation
Clone of OpenAI's Spinning Up in PyTorch
- Host: GitHub
- URL: https://github.com/kashif/firedup
- Owner: kashif
- License: mit
- Created: 2018-11-25T15:46:43.000Z (about 6 years ago)
- Default Branch: master
- Last Pushed: 2022-04-19T10:56:23.000Z (almost 3 years ago)
- Last Synced: 2024-11-12T13:39:07.966Z (3 months ago)
- Topics: deep-learning, pytorch, reinforcement-learning, spinningup
- Language: Python
- Homepage:
- Size: 151 KB
- Stars: 146
- Watchers: 7
- Forks: 25
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Welcome to Fired Up in Deep RL!
This is a clone of OpenAI's [Spinning Up](https://github.com/openai/spinningup) in PyTorch. Spinning Up is an awesome educational resource produced by Josh Achiam, a research scientist at [OpenAI](https://openai.com/), that makes it easier to learn about deep reinforcement learning (deep RL).
## Installation
Fired Up requires Python3, PyTorch, OpenAI Gym, and OpenMPI.
Fired Up is currently only supported on Linux and OSX. It may be possible to install on Windows, though I haven't tested this OS.
### Installing Python
We recommend installing Python through [Anaconda](https://www.anaconda.com/distribution/#download-section). Anaconda is a Python distribution that includes many useful packages especially for scientific computing, as well as an environment manager called `conda` that makes package management simple.
Download and install Anaconda 2018.x (at time of writing, 2018.12) Python 3.7. Then create a `conda` environment for organizing packages used in Fired Up:
```
conda create -n firedup python=3.7
```To use Python from the environment you just created, activate the environment with:
```
source activate firedup
```You can alternatively use [virtualenv](https://virtualenv.pypa.io/en/latest/) with the Python3 version you have. Just install it via `pip3` and then:
```
virtualenv firedup
```To activate this virtual environment you need to:
```
source /path/to/firedup/bin/activate
```### Installing OpenMPI
#### Ubuntu
```
sudo apt update && sudo apt install libopenmpi-dev
```#### Mac OS X
Installation of system packages on Mac requires [Homebrew](https://brew.sh). With Homebrew installed, run the following:
```
brew install openmpi
```### Installing Fired Up
```
git clone https://github.com/kashif/firedup.git
cd firedup
pip install -e .
```Fired Up defaults to installing everything in Gym **except** the MuJoCo environments.
### Check Your Install
To see if you've successfully installed Fired Up, try running PPO in the `LunarLander-v2` environment with:
```
python -m fireup.run ppo --hid "[32,32]" --env LunarLander-v2 --exp_name installtest --gamma 0.999
```After it finishes training, watch a video of the trained policy with:
```
python -m fireup.run test_policy data/installtest/installtest_s0
```And plot the results with:
```
python -m fireup.run plot data/installtest/installtest_s0
```## Algorithms
The following algorithms are implemented in the Fired Up package:
* Vanilla Policy Gradient (VPG)
* Trust Region Policy Optimization (TRPO)
* Proximal Policy Optimization (PPO)
* Deep Q-Network (DQN)
* Deep Deterministic Policy Gradient (DDPG)
* Twin Delayed DDPG (TD3)
* Soft Actor-Critic (SAC)They are all implemented with MLP (non-recurrent) actor-critics, making them suitable for fully-observed, non-image-based RL environments, e.g. the Gym Mujoco environments.
## Citation
If you use Fired Up in your research please use the following BibTeX entry:
```BibTeX
@misc{rasulfiredup,
author = {Kashif Rasul and Joshua Achiam},
title = {Fired Up},
howpublished = {\url{https://github.com/kashif/firedup/}},
year = {2019}
}
```