https://github.com/fkodom/ddpg-her-pytorch

Implementation of the Hindsight Experience Replay (HER) algorithm using PyTorch. Utilizes Deep Deterministic Policy Gradients for off-policy optimization of RL agents in continuous action spaces.
https://github.com/fkodom/ddpg-her-pytorch

Last synced: 3 months ago
JSON representation

Implementation of the Hindsight Experience Replay (HER) algorithm using PyTorch. Utilizes Deep Deterministic Policy Gradients for off-policy optimization of RL agents in continuous action spaces.

Host: GitHub
URL: https://github.com/fkodom/ddpg-her-pytorch
Owner: fkodom
License: gpl-3.0
Created: 2019-07-09T20:59:53.000Z (about 6 years ago)
Default Branch: master
Last Pushed: 2019-07-10T01:18:04.000Z (about 6 years ago)
Last Synced: 2025-02-08T09:12:05.519Z (5 months ago)
Language: Python
Size: 2.36 MB
Stars: 1
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

        # DDPG+HER-PyTorch

Implementation of the [Hindsight Experience Replay](https://arxiv.org/abs/1707.01495) algorithm using PyTorch.  Utilizes [Deep Deterministic Policy Gradients](https://arxiv.org/pdf/1509.02971.pdf) for off-policy optimization of the RL agent -- hence, DDPG+HER.  Includes (for now) just one pre-trained example agent (FetchReach-v1), as well as a training script for creating new agents.

### FetchReach-v1 Agent

![DDPG+HER FetchReach-v1 Animation](https://raw.githubusercontent.com/fkodom/ddpg-her-pytorch/master/figures/fetch-reach.gif)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/fkodom/ddpg-her-pytorch

Awesome Lists containing this project

README