https://github.com/alro10/rl_pong

This is a repository for project in MO810 course-1s2018 IC-UNICAMP. The project is about implement DQN, ES and policy gradients for Pong and Catch game.
https://github.com/alro10/rl_pong

catch deep-reinforcement-learning dqn google-colab google-colaboratory pong-game

Last synced: 7 months ago
JSON representation

This is a repository for project in MO810 course-1s2018 IC-UNICAMP. The project is about implement DQN, ES and policy gradients for Pong and Catch game.

Host: GitHub
URL: https://github.com/alro10/rl_pong
Owner: Alro10
License: mit
Created: 2018-06-12T18:55:22.000Z (over 7 years ago)
Default Branch: master
Last Pushed: 2018-06-19T20:44:09.000Z (over 7 years ago)
Last Synced: 2025-01-20T13:27:28.287Z (9 months ago)
Topics: catch, deep-reinforcement-learning, dqn, google-colab, google-colaboratory, pong-game
Language: Jupyter Notebook
Size: 183 KB
Stars: 0
Watchers: 1
Forks: 1
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# DQN-NES-Pong
This is a repository for project in MO810 course-1s2018 IC-UNICAMP. The project is about implement DQN, NES and policy gradients for Pong and Catch game.

## Requirements

Python 3.5, PyTorch >= 0.2.0, numpy, gym, universe, cv2.

## Deep Q-learning (DQN):

* `dqn_pong.ipynb` : This is a DQN implementation for Pong game (gym environment) and was trained in google colab (aprox 5 hours). Achieved **reward = 18**.

* `kerasdqn_catch.ipynb`: For learn more about DQN, I decided to implement a shallow neural network for catch enviroment. See the results in file.

## Evolution Strategies (Natural):

* `main.py` : Train ES on Pong and achieved **reward = 5** after 72 hours of training. Functions from `train.py`, `envs.py` , `model.py`

```
python3 main.py --env-name Pong-v4 --n 10 --lr 0.01 --useAdam
```

* `catch_ES.ipynb`: NES implementation for catch game! See results in flie.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/alro10/rl_pong

Awesome Lists containing this project

README