Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/pythonlessons/reinforcement_learning

Reinforcement learning tutorials
https://github.com/pythonlessons/reinforcement_learning

a2c a3c actor-critic-algorythm bipedalwalker d3qn ddqn dqn dueling-dqn lunarlander policy-gradient ppo ppo-agent reinforcement-learning

Last synced: 7 days ago
JSON representation

Reinforcement learning tutorials

Host: GitHub
URL: https://github.com/pythonlessons/reinforcement_learning
Owner: pythonlessons
License: mit
Created: 2020-01-13T14:27:41.000Z (almost 5 years ago)
Default Branch: master
Last Pushed: 2023-03-25T01:33:39.000Z (over 1 year ago)
Last Synced: 2024-12-07T02:33:47.150Z (15 days ago)
Topics: a2c, a3c, actor-critic-algorythm, bipedalwalker, d3qn, ddqn, dqn, dueling-dqn, lunarlander, policy-gradient, ppo, ppo-agent, reinforcement-learning
Language: Python
Homepage: https://pylessons.com/
Size: 87 MB
Stars: 362
Watchers: 7
Forks: 154
Open Issues: 5
Metadata Files:
- Readme: README.md
- License: LICENSE.md

Awesome Lists containing this project

README

        # Reinforcement Learning Tutorials:

*2020-10-07 added support for Tensorflow 2.3.1*

PPO and PPO_CNN agents playing Pong-v0 game:


![PPO agent](11_Pong-v0_PPO/gameplay.gif)

![PPO CNN agent](11_Pong-v0_PPO/gameplay_CNN.gif)

*2020-10-10 added LunarLander-v2_PPO Continuous code for Tensorflow 2.3.1*:

![LunarLander-v2_PPO](https://github.com/pythonlessons/Reinforcement_Learning/blob/master/LunarLander-v2_PPO/gameplay.gif)

*2020-10-23 added BipedalWalker-v3_PPO code for Tensorflow 2.3.1*:

![BipedalWalker-v3_PPO_PPO](https://github.com/pythonlessons/Reinforcement_Learning/blob/master/BipedalWalker-v3_PPO/gameplay.gif)

1. [Deep Q Learning tutorial (DQN)](https://pylessons.com/CartPole-reinforcement-learning/)

2. [Double Deep Q Learning tutorial (DDQN)](https://pylessons.com/CartPole-DDQN/)

3. [Dueling Double Deep Q Learning tutorial (D3QN)](https://pylessons.com/CartPole-DDDQN/)

4. [Epsilon Greedy Dueling Double Deep Q Learning tutorial (D3QN)](https://pylessons.com/Epsilon-Greedy-DQN/)

5. [Prioritized Experience Replay (PER) D3QN tutorial](https://pylessons.com/CartPole-PER/)

6. [D3QN PER with Convolutional Neural Networks tutorial](https://pylessons.com/CartPole-PER-CNN/)

7. [A.I. learns to play Pong with DQN](https://pylessons.com/DQN-PONG/)

8. [Introduction to RL Policy Gradient (PG or REINFORCE)](https://pylessons.com/Beyond-DQN/)

9. [Introduction to RL Advanced Actor Critic algorythm (A2C)](https://pylessons.com/A2C-reinforcement-learning/)

10. [Introduction to RL Asynchronous Advanced Actor Critic algorythm (A3C)](https://pylessons.com/A3C-reinforcement-learning/)

11. [Introduction to RL Proximal Policy Optimization algorythm (PPO)](https://pylessons.com/PPO-reinforcement-learning/)

12. [Let’s code from scratch a discrete Reinforcement Learning rocket landing agent! (PPO)](https://pylessons.com/LunarLander-v2-PPO/)

13. [Continuous Proximal Policy Optimization Tutorial with OpenAI gym environment! (PPO)](https://pylessons.com/BipedalWalker-v3-PPO/)





PPO Pong-v0 Learning curve: