https://github.com/beegass/deep-q-learning
This is my attempt at implementing the paper "Playing Atari with Deep Reinforcement Learning" By Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra and Martin Riedmiller. This is my first attempt at both reading and implementing a research paper.
https://github.com/beegass/deep-q-learning
deep-q-network deep-reinforcement-learning q-learning reinforcement-learning
Last synced: 14 days ago
JSON representation
This is my attempt at implementing the paper "Playing Atari with Deep Reinforcement Learning" By Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra and Martin Riedmiller. This is my first attempt at both reading and implementing a research paper.
- Host: GitHub
- URL: https://github.com/beegass/deep-q-learning
- Owner: BeeGass
- Created: 2021-04-09T20:53:52.000Z (over 4 years ago)
- Default Branch: main
- Last Pushed: 2021-05-11T03:33:48.000Z (over 4 years ago)
- Last Synced: 2025-09-09T14:34:10.906Z (26 days ago)
- Topics: deep-q-network, deep-reinforcement-learning, q-learning, reinforcement-learning
- Language: Jupyter Notebook
- Homepage:
- Size: 6.65 MB
- Stars: 3
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Deep_Q-Learning
## What Files Should I Look At
* There has been poor use of version control due in part to the newness of google colab to the group.
* To view code associated with the latest iteration in our approach to DQN please refer to [Deep_Q_Network.ipynb](https://github.com/BeeGass/Deep-Q-Learning/blob/main/Deep_Q_Network.ipynb). If you want to view the visualization to portions of DQN please refer to [Manim_For_DQN.ipynb](https://github.com/BeeGass/Deep-Q-Learning/blob/main/Manim_For_DQN.ipynb)## Resources That Helped
* [Gym Documentation Is Fantastic](https://gym.openai.com/envs/#toy_text)
* [cyoon1729's "vanillaDQN" has some incredibly clear and concise code](https://github.com/cyoon1729/deep-Q-networks)
* [Chris Yoon's, aka cyoon1729, Medium Article On DQN](https://towardsdatascience.com/dqn-part-1-vanilla-deep-q-networks-6eb4a00febfb)
* [Deep Q-Network (DQN)-I](https://towardsdatascience.com/deep-q-network-dqn-i-bce08bdf2af)
* [Deep Q-Network (DQN)-II](https://towardsdatascience.com/deep-q-network-dqn-ii-b6bf911b6b2c)
* [PyTorch's REINFORCEMENT LEARNING (DQN) TUTORIAL](https://pytorch.org/tutorials/intermediate/reinforcement_q_learning.html)
* [PyTorch's TRAIN A MARIO-PLAYING RL AGENT](https://pytorch.org/tutorials/intermediate/mario_rl_tutorial.html)
* [The Great David Silver And His Lectures On Reinforcement Learning](https://deepmind.com/learning-resources/-introduction-reinforcement-learning-david-silver)
* [Really wonderful resource that shows side by side explanation of code](https://nn.labml.ai/rl/dqn/index.html)
* [Another Great Tutorial With Code](https://arztsamuel.github.io/en/blogs/2018/Gym-and-Baselines-on-Windows.html)
* [For When You Want To Learn About Epsilon Decay](https://stackoverflow.com/questions/53198503/epsilon-and-learning-rate-decay-in-epsilon-greedy-q-learning)
* [Of Course The Deep Q Network Paper](https://www.cs.toronto.edu/~vmnih/docs/dqn.pdf)## Thanks
Thank you to
* @edkazcarlson
* @moneill0
* @20hub
* @yichenlilyc
* @dtkatchfor the late night/early morning help
## Clips
https://user-images.githubusercontent.com/44324535/117367069-a6631600-ae8f-11eb-99b9-fd7bb431e311.mp4
https://user-images.githubusercontent.com/44324535/117367113-b7ac2280-ae8f-11eb-9885-b7edc8bb42d0.mp4
https://user-images.githubusercontent.com/44324535/117367122-bbd84000-ae8f-11eb-8981-a8975fc9b7fc.mp4
https://user-images.githubusercontent.com/44324535/117367127-be3a9a00-ae8f-11eb-91e9-d88d73a87eb9.mp4
https://user-images.githubusercontent.com/44324535/117367154-c692d500-ae8f-11eb-876c-51868db1fe6a.mp4
https://user-images.githubusercontent.com/44324535/117367163-c98dc580-ae8f-11eb-9e7e-6c45995cd175.mp4
https://user-images.githubusercontent.com/44324535/117367185-d3afc400-ae8f-11eb-8055-5feabf01f88c.mp4
https://user-images.githubusercontent.com/44324535/117367203-dc07ff00-ae8f-11eb-9d71-29ecac574379.mp4
https://user-images.githubusercontent.com/44324535/117367231-e6c29400-ae8f-11eb-8bfc-ec28f2a0193a.mp4
https://user-images.githubusercontent.com/44324535/117367237-eb874800-ae8f-11eb-8d73-0d34d1df6a7b.mp4
https://user-images.githubusercontent.com/44324535/117367241-ecb87500-ae8f-11eb-8b42-8fb668061ab4.mp4
https://user-images.githubusercontent.com/44324535/117367250-efb36580-ae8f-11eb-918b-8c0e6f253c66.mp4
https://user-images.githubusercontent.com/44324535/117367264-f510b000-ae8f-11eb-9846-b07a7e71e9af.mp4