https://github.com/jimkon/Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces

Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym
https://github.com/jimkon/Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces

ddpg deep-reinforcement-learning discrete-actions wolpertinger

Last synced: 6 months ago
JSON representation

Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym

Host: GitHub
URL: https://github.com/jimkon/Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces
Owner: jimkon
License: mit
Created: 2017-11-25T14:41:04.000Z (almost 8 years ago)
Default Branch: master
Last Pushed: 2018-03-01T15:47:02.000Z (over 7 years ago)
Last Synced: 2024-11-04T18:45:20.380Z (11 months ago)
Topics: ddpg, deep-reinforcement-learning, discrete-actions, wolpertinger
Language: Python
Size: 208 MB
Stars: 173
Watchers: 9
Forks: 54
Open Issues: 5
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces
Link to [paper](https://arxiv.org/abs/1512.07679)

Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym.

This paper introduces Wolpertinger training algorithm that extends the Deep Deterministic Policy Gradient training algorithm introduced in [this](https://arxiv.org/abs/1509.02971) paper.

I used and extended **stevenpjg**'s implementation of **DDPG** algorithm found [here](https://github.com/stevenpjg/ddpg-aigym) licensed under the MIT license.

Master is currently **only for continuous action spaces**.

The branch discrete-and-continuous provides the ability to use the discrete environments of the gym.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/jimkon/Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces

Awesome Lists containing this project

README