Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/pythonlessons/reinforcement_learning

Reinforcement learning tutorials
https://github.com/pythonlessons/reinforcement_learning

a2c a3c actor-critic-algorythm bipedalwalker d3qn ddqn dqn dueling-dqn lunarlander policy-gradient ppo ppo-agent reinforcement-learning

Last synced: 7 days ago
JSON representation

Reinforcement learning tutorials

Awesome Lists containing this project

README

        

# Reinforcement Learning Tutorials:

*2020-10-07 added support for Tensorflow 2.3.1*

PPO and PPO_CNN agents playing Pong-v0 game:

![PPO agent](11_Pong-v0_PPO/gameplay.gif)
![PPO CNN agent](11_Pong-v0_PPO/gameplay_CNN.gif)

*2020-10-10 added LunarLander-v2_PPO Continuous code for Tensorflow 2.3.1*:
![LunarLander-v2_PPO](https://github.com/pythonlessons/Reinforcement_Learning/blob/master/LunarLander-v2_PPO/gameplay.gif)

*2020-10-23 added BipedalWalker-v3_PPO code for Tensorflow 2.3.1*:
![BipedalWalker-v3_PPO_PPO](https://github.com/pythonlessons/Reinforcement_Learning/blob/master/BipedalWalker-v3_PPO/gameplay.gif)

1. [Deep Q Learning tutorial (DQN)](https://pylessons.com/CartPole-reinforcement-learning/)

2. [Double Deep Q Learning tutorial (DDQN)](https://pylessons.com/CartPole-DDQN/)

3. [Dueling Double Deep Q Learning tutorial (D3QN)](https://pylessons.com/CartPole-DDDQN/)

4. [Epsilon Greedy Dueling Double Deep Q Learning tutorial (D3QN)](https://pylessons.com/Epsilon-Greedy-DQN/)

5. [Prioritized Experience Replay (PER) D3QN tutorial](https://pylessons.com/CartPole-PER/)

6. [D3QN PER with Convolutional Neural Networks tutorial](https://pylessons.com/CartPole-PER-CNN/)

7. [A.I. learns to play Pong with DQN](https://pylessons.com/DQN-PONG/)

8. [Introduction to RL Policy Gradient (PG or REINFORCE)](https://pylessons.com/Beyond-DQN/)

9. [Introduction to RL Advanced Actor Critic algorythm (A2C)](https://pylessons.com/A2C-reinforcement-learning/)

10. [Introduction to RL Asynchronous Advanced Actor Critic algorythm (A3C)](https://pylessons.com/A3C-reinforcement-learning/)

11. [Introduction to RL Proximal Policy Optimization algorythm (PPO)](https://pylessons.com/PPO-reinforcement-learning/)

12. [Let’s code from scratch a discrete Reinforcement Learning rocket landing agent! (PPO)](https://pylessons.com/LunarLander-v2-PPO/)

13. [Continuous Proximal Policy Optimization Tutorial with OpenAI gym environment! (PPO)](https://pylessons.com/BipedalWalker-v3-PPO/)



PPO Pong-v0 Learning curve: