An open API service indexing awesome lists of open source software.

https://github.com/agentmaker/paddle-rlbooks

Paddle-RLBooks is a reinforcement learning code study guide based on pure PaddlePaddle.
https://github.com/agentmaker/paddle-rlbooks

actor-critic c51 ddpg double-dqn dqn dueling-dqn noisy-dqn nstep-dqn paddlepaddle policy-gradient policy-gradient-with-baseline policy-iteration q-learning reinforce reinforcement-learning sac sarsa td3 value-iteration

Last synced: 9 months ago
JSON representation

Paddle-RLBooks is a reinforcement learning code study guide based on pure PaddlePaddle.

Awesome Lists containing this project

README

          

# Paddle-RLBooks

Welcome to Paddle-RLBooks which is a reinforcement learning code study guide based on pure PaddlePaddle.

欢迎来到Paddle-RLBooks,该仓库主要是针对强化学习中的一些算法进行整理,包括DQN、TD3、SAC等算法,并且每个都配备了游戏可直接一键运行,欢迎star~

## Show
![](./material/flappybird.jpg)

## Codes
- Release
- [x] [Policy Iteration](./policy_iteration)
- [x] [Value Iteration](./value_iteration)
- [x] [Sarsa](./sarsa)
- [x] [Q-learning](./qlearning)
- [x] [DQN](./dqn)
- [x] [DQN](./dqn/dqn)
- [x] [Nstep-DQN](./dqn/nstep_dqn)
- [x] [Double-DQN](./dqn/double_dqn)
- [x] [Dueling-DQN](./dqn/dueling_dqn)
- [x] [Noisy-DQN](./dqn/dqn_noisy_networks)
- [x] [C51](./dqn/categorical_dqn(C51))
- [x] [Policy Gradient](./policy_gradient)
- [x] [Policy Gradient Basic](./policy_gradient)
- [x] [Reinforce](./policy_gradient)
- [x] [Policy Gradient With Baseline](./policy_gradient)
- [x] [Actor-Critic](./actor_critic)
- [x] [DDPG](./ddpg)
- [x] [TD3](./td3)
- [x] [SAC](./sac)
- Coming Soon
- [ ] TRPO
- [ ] ACKTR
- [ ] A2C
- [ ] A3C
- [ ] PPO
- [ ] DRQN
- [ ] QMIX
- [ ] MAPPDG
- [ ] MFMARL

## Contact us
Email : [agentmaker@163.com]()

QQ Group : 1005109853