Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/fkodom/ddpg-her-pytorch

Implementation of the Hindsight Experience Replay (HER) algorithm using PyTorch. Utilizes Deep Deterministic Policy Gradients for off-policy optimization of RL agents in continuous action spaces.
https://github.com/fkodom/ddpg-her-pytorch

Last synced: 7 days ago
JSON representation

Implementation of the Hindsight Experience Replay (HER) algorithm using PyTorch. Utilizes Deep Deterministic Policy Gradients for off-policy optimization of RL agents in continuous action spaces.

Awesome Lists containing this project

README

        

# DDPG+HER-PyTorch

Implementation of the [Hindsight Experience Replay](https://arxiv.org/abs/1707.01495) algorithm using PyTorch. Utilizes [Deep Deterministic Policy Gradients](https://arxiv.org/pdf/1509.02971.pdf) for off-policy optimization of the RL agent -- hence, DDPG+HER. Includes (for now) just one pre-trained example agent (FetchReach-v1), as well as a training script for creating new agents.

### FetchReach-v1 Agent

![DDPG+HER FetchReach-v1 Animation](https://raw.githubusercontent.com/fkodom/ddpg-her-pytorch/master/figures/fetch-reach.gif)