https://github.com/devsisters/DQN-tensorflow

Tensorflow implementation of Human-Level Control through Deep Reinforcement Learning
https://github.com/devsisters/DQN-tensorflow

Last synced: 3 months ago
JSON representation

Tensorflow implementation of Human-Level Control through Deep Reinforcement Learning

Host: GitHub
URL: https://github.com/devsisters/DQN-tensorflow
Owner: devsisters
License: mit
Created: 2016-05-15T11:33:47.000Z (about 9 years ago)
Default Branch: master
Last Pushed: 2019-04-18T18:36:45.000Z (about 6 years ago)
Last Synced: 2025-03-24T03:01:34.465Z (3 months ago)
Language: Python
Homepage:
Size: 28.9 MB
Stars: 2,522
Watchers: 141
Forks: 765
Open Issues: 38
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

awesome-tensorflow - DQN-tensorflow - TensorFlow implementation of DeepMind's 'Human-Level Control through Deep Reinforcement Learning' with OpenAI Gym by Devsisters.com (Models/Projects)
Github-Repositories - DQN-tensorflow
awesome-tensorflow - DQN-tensorflow - TensorFlow implementation of DeepMind's 'Human-Level Control through Deep Reinforcement Learning' with OpenAI Gym by Devsisters.com (Models/Projects)
fucking-awesome-tensorflow - DQN-tensorflow - TensorFlow implementation of DeepMind's 'Human-Level Control through Deep Reinforcement Learning' with OpenAI Gym by Devsisters.com (Models/Projects)
Awesome-TensorFlow-Chinese - DQN-tensorflow - TensorFlow implementation of DeepMind's 'Human-Level Control through Deep Reinforcement Learning' with OpenAI Gym by Devsisters.com (模型项目 / 微信群)

README

        # Human-Level Control through Deep Reinforcement Learning

Tensorflow implementation of [Human-Level Control through Deep Reinforcement Learning](http://home.uchicago.edu/~arij/journalclub/papers/2015_Mnih_et_al.pdf).

![model](assets/model.png)

This implementation contains:

1. Deep Q-network and Q-learning

2. Experience replay memory

    - to reduce the correlations between consecutive updates

3. Network for Q-learning targets are fixed for intervals

    - to reduce the correlations between target and predicted Q-values

## Requirements

- Python 2.7 or Python 3.3+

- [gym](https://github.com/openai/gym)

- [tqdm](https://github.com/tqdm/tqdm)

- [SciPy](http://www.scipy.org/install.html) or [OpenCV2](http://opencv.org/)

- [TensorFlow 0.12.0](https://github.com/tensorflow/tensorflow/tree/r0.12)

## Usage

First, install prerequisites with:

    $ pip install tqdm gym[all]

To train a model for Breakout:

    $ python main.py --env_name=Breakout-v0 --is_train=True

    $ python main.py --env_name=Breakout-v0 --is_train=True --display=True

To test and record the screen with gym:

    $ python main.py --is_train=False

    $ python main.py --is_train=False --display=True

## Results

Result of training for 24 hours using GTX 980 ti.

![best](assets/best.gif)

## Simple Results

Details of `Breakout` with model `m2`(red) for 30 hours using GTX 980 Ti.

![tensorboard](assets/0620_scalar_step_m2.png)

Details of `Breakout` with model `m3`(red) for 30 hours using GTX 980 Ti.

![tensorboard](assets/0620_scalar_step_m3.png)

## Detailed Results

**[1] Action-repeat (frame-skip) of 1, 2, and 4 without learning rate decay**

![A1_A2_A4_0.00025lr](assets/A1_A2_A4_0.00025lr.png)

**[2] Action-repeat (frame-skip) of 1, 2, and 4 with learning rate decay**

![A1_A2_A4_0.0025lr](assets/A1_A2_A4_0.0025lr.png)

**[1] & [2]**

![A1_A2_A4_0.00025lr_0.0025lr](assets/A1_A2_A4_0.00025lr_0.0025lr.png)

**[3] Action-repeat of 4 for DQN (dark blue) Dueling DQN (dark green) DDQN (brown) Dueling DDQN (turquoise)**

The current hyper parameters and gradient clipping are not implemented as it is in the paper.

![A4_duel_double](assets/A4_duel_double.png)

**[4] Distributed action-repeat (frame-skip) of 1 without learning rate decay**

![A1_0.00025lr_distributed](assets/A4_0.00025lr_distributed.png)

**[5] Distributed action-repeat (frame-skip) of 4 without learning rate decay**

![A4_0.00025lr_distributed](assets/A4_0.00025lr_distributed.png)

## References

- [simple_dqn](https://github.com/tambetm/simple_dqn.git)

- [Code for Human-level control through deep reinforcement learning](https://sites.google.com/a/deepmind.com/dqn/)

## License

MIT License.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/devsisters/DQN-tensorflow

Awesome Lists containing this project

README