An open API service indexing awesome lists of open source software.

https://github.com/beegass/deep-q-learning

This is my attempt at implementing the paper "Playing Atari with Deep Reinforcement Learning" By Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra and Martin Riedmiller. This is my first attempt at both reading and implementing a research paper.
https://github.com/beegass/deep-q-learning

deep-q-network deep-reinforcement-learning q-learning reinforcement-learning

Last synced: 14 days ago
JSON representation

This is my attempt at implementing the paper "Playing Atari with Deep Reinforcement Learning" By Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra and Martin Riedmiller. This is my first attempt at both reading and implementing a research paper.

Awesome Lists containing this project

README

          

# Deep_Q-Learning

## What Files Should I Look At
* There has been poor use of version control due in part to the newness of google colab to the group.
* To view code associated with the latest iteration in our approach to DQN please refer to [Deep_Q_Network.ipynb](https://github.com/BeeGass/Deep-Q-Learning/blob/main/Deep_Q_Network.ipynb). If you want to view the visualization to portions of DQN please refer to [Manim_For_DQN.ipynb](https://github.com/BeeGass/Deep-Q-Learning/blob/main/Manim_For_DQN.ipynb)

## Resources That Helped
* [Gym Documentation Is Fantastic](https://gym.openai.com/envs/#toy_text)
* [cyoon1729's "vanillaDQN" has some incredibly clear and concise code](https://github.com/cyoon1729/deep-Q-networks)
* [Chris Yoon's, aka cyoon1729, Medium Article On DQN](https://towardsdatascience.com/dqn-part-1-vanilla-deep-q-networks-6eb4a00febfb)
* [Deep Q-Network (DQN)-I](https://towardsdatascience.com/deep-q-network-dqn-i-bce08bdf2af)
* [Deep Q-Network (DQN)-II](https://towardsdatascience.com/deep-q-network-dqn-ii-b6bf911b6b2c)
* [PyTorch's REINFORCEMENT LEARNING (DQN) TUTORIAL](https://pytorch.org/tutorials/intermediate/reinforcement_q_learning.html)
* [PyTorch's TRAIN A MARIO-PLAYING RL AGENT](https://pytorch.org/tutorials/intermediate/mario_rl_tutorial.html)
* [The Great David Silver And His Lectures On Reinforcement Learning](https://deepmind.com/learning-resources/-introduction-reinforcement-learning-david-silver)
* [Really wonderful resource that shows side by side explanation of code](https://nn.labml.ai/rl/dqn/index.html)
* [Another Great Tutorial With Code](https://arztsamuel.github.io/en/blogs/2018/Gym-and-Baselines-on-Windows.html)
* [For When You Want To Learn About Epsilon Decay](https://stackoverflow.com/questions/53198503/epsilon-and-learning-rate-decay-in-epsilon-greedy-q-learning)
* [Of Course The Deep Q Network Paper](https://www.cs.toronto.edu/~vmnih/docs/dqn.pdf)

## Thanks
Thank you to
* @edkazcarlson
* @moneill0
* @20hub
* @yichenlilyc
* @dtkatch

for the late night/early morning help

## Clips

https://user-images.githubusercontent.com/44324535/117367069-a6631600-ae8f-11eb-99b9-fd7bb431e311.mp4

https://user-images.githubusercontent.com/44324535/117367113-b7ac2280-ae8f-11eb-9885-b7edc8bb42d0.mp4

https://user-images.githubusercontent.com/44324535/117367122-bbd84000-ae8f-11eb-8981-a8975fc9b7fc.mp4

https://user-images.githubusercontent.com/44324535/117367127-be3a9a00-ae8f-11eb-91e9-d88d73a87eb9.mp4

https://user-images.githubusercontent.com/44324535/117367154-c692d500-ae8f-11eb-876c-51868db1fe6a.mp4

https://user-images.githubusercontent.com/44324535/117367163-c98dc580-ae8f-11eb-9e7e-6c45995cd175.mp4

https://user-images.githubusercontent.com/44324535/117367185-d3afc400-ae8f-11eb-8055-5feabf01f88c.mp4

https://user-images.githubusercontent.com/44324535/117367203-dc07ff00-ae8f-11eb-9d71-29ecac574379.mp4

https://user-images.githubusercontent.com/44324535/117367231-e6c29400-ae8f-11eb-8bfc-ec28f2a0193a.mp4

https://user-images.githubusercontent.com/44324535/117367237-eb874800-ae8f-11eb-8d73-0d34d1df6a7b.mp4

https://user-images.githubusercontent.com/44324535/117367241-ecb87500-ae8f-11eb-8b42-8fb668061ab4.mp4

https://user-images.githubusercontent.com/44324535/117367250-efb36580-ae8f-11eb-918b-8c0e6f253c66.mp4

https://user-images.githubusercontent.com/44324535/117367264-f510b000-ae8f-11eb-9846-b07a7e71e9af.mp4