https://github.com/allenpandas/tutorial4rl

Tutorial4RL: Tutorial for Reinforcement Learning. 强化学习入门教程.
https://github.com/allenpandas/tutorial4rl

a3c ddpg deep-reinforcement-learning dqn inverse-reinforcement-learning multi-agent-reinforcement-learning multi-agent-systems policy-gradient qlearning reinforcement-learning reinforcementlearning-tutorial rl-tutorial sarsa tutorial

Last synced: 3 months ago
JSON representation

Tutorial4RL: Tutorial for Reinforcement Learning. 强化学习入门教程.

Host: GitHub
URL: https://github.com/allenpandas/tutorial4rl
Owner: Allenpandas
License: mit
Created: 2023-01-07T07:52:38.000Z (over 2 years ago)
Default Branch: main
Last Pushed: 2024-03-27T01:34:54.000Z (about 1 year ago)
Last Synced: 2025-01-24T02:19:44.124Z (4 months ago)
Topics: a3c, ddpg, deep-reinforcement-learning, dqn, inverse-reinforcement-learning, multi-agent-reinforcement-learning, multi-agent-systems, policy-gradient, qlearning, reinforcement-learning, reinforcementlearning-tutorial, rl-tutorial, sarsa, tutorial
Homepage:
Size: 4.17 MB
Stars: 128
Watchers: 6
Forks: 12
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

        # Tutorial4RL

Tutorial4RL: Tutorial for Reinforcement Learning. 强化学习入门教程.

## Related Repository

| Repository                                                   | Remark                                                       |

| ------------------------------------------------------------ | ------------------------------------------------------------ |

| [Awesome-Reinforcement-Learning-Papers](https://github.com/Allenpandas/Awesome-Reinforcement-Learning-Papers) |    |

| [Tutorial4RL](https://github.com/Allenpandas/Tutorial4RL)    |    |

| [2023-Reinforcement-Learning-Conferences-Papers](https://github.com/Allenpandas/2023-Reinforcement-Learning-Conferences-Papers) |    |

| [2022-Reinforcement-Learning-Conferences-Papers](https://github.com/Allenpandas/2022-Reinforcement-Learning-Conferences-Papers) |    |

| [2021-Reinforcement-Learning-Conferences-Papers](https://github.com/Allenpandas/2021-Reinforcement-Learning-Conferences-Papers) |    |

| [2020-Reinforcement-Learning-Conferences-Papers](https://github.com/Allenpandas/2020-Reinforcement-Learning-Conferences-Papers) |    |

| [2019-Reinforcement-Learning-Conferences-Papers](https://github.com/Allenpandas/2019-Reinforcement-Learning-Conferences-Papers) |    |

| [2018-Reinforcement-Learning-Conferences-Papers](https://github.com/Allenpandas/2018-Reinforcement-Learning-Conferences-Papers) |    |

| [2017-Reinforcement-Learning-Conferences-Papers](https://github.com/Allenpandas/2017-Reinforcement-Learning-Conferences-Papers) |    |

## Open Source Projects

- PFRL：基于Pytorch的深度强化学习库： [https://github.com/pfnet/pfrl](https://github.com/pfnet/pfrl)

- 莫烦强化学习TensorFlow代码： [https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow)

- 百度飞桨PaddlePaddle强化学习代码： [https://github.com/PaddlePaddle/PARL](https://github.com/PaddlePaddle/PARL)

- Github强大的强化学习库： [https://github.com/wwxFromTju/awesome-reinforcement-learning-lib](https://github.com/wwxFromTju/awesome-reinforcement-learning-lib)

- 优达学城（在线教育平台）强化学习库： [https://github.com/udacity/deep-reinforcement-learning](https://github.com/udacity/deep-reinforcement-learning)

## Books & Videos

- 《深度强化学习》王树森： [https://www.bilibili.com/video/BV12o4y197US](https://www.bilibili.com/video/BV12o4y197US)

- 《Deep Reinforcement Learning》李宏毅： [https://www.bilibili.com/video/BV1UE411G78S](https://www.bilibili.com/video/BV1UE411G78S)

- 《世界冠军带你从零实践强化学习》百度飞桨团队： [https://www.bilibili.com/video/BV1yv411i7xd](https://www.bilibili.com/video/BV1yv411i7xd)

- 《强化学习白板推导》：[https://space.bilibili.com/97068901/channel/seriesdetail?sid=594040](https://space.bilibili.com/97068901/channel/seriesdetail?sid=594040)

- 《蘑菇书EasyRL》王琦等： [https://github.com/datawhalechina/easy-rl](https://github.com/datawhalechina/easy-rl)

- 《动手学强化学习》张伟楠等： [http://hrl.boyuai.com/](http://hrl.boyuai.com/)

## Relevant Conferences

| Abbr.   | Full Name                                                    | CCF Rank |

| ------- | ------------------------------------------------------------ | :------: |

| ICML    | International Conference on Machine Learning                 |  CCF-A   |

| NeurIPS | Annual Conference on Neural Information Processing Systems   |  CCF-A   |

| ICLR    | International Conference on Learning Representations         |    —     |

| AAAI    | AAAI Conference on Artificial Intelligence                   |  CCF-A   |

| IJCAI   | International Joint Conference on Artificial Intelligence    |  CCF-A   |

| AAMAS   | International Joint Conference on Autonomous Agents and Multi-agent Systems |  CCF-B   |

| ICRA    | IEEE International Conference on Robotics and Automation     |  CCF-B   |

## Community

- RLChina强化学习社区： [http://rlchina.org/](http://rlchina.org/)

- 智源社区强化学习专栏： [https://hub.baai.ac.cn/?tag_id=74](https://hub.baai.ac.cn/?tag_id=74)

- 智源社区强化学习周刊： [https://hub.baai.ac.cn/users/18447](https://hub.baai.ac.cn/users/18447)

## Langya Rank

### Domestic Langya Rank

| Name   | Organization       | Link                                                      | Focus                                    |

| ------ | ------------------ | --------------------------------------------------------- | ---------------------------------------- |

| 郝建业 | 天津大学           | [[HomePage](http://www.icdai.org/jianye.html)]            | 多智能体强化学习、博弈论                 |

| 张海峰 | 中科院自动化所     | [[HomePage](https://people.ucas.edu.cn/~zhf)]             | 多智能体强化学习、智能体博弈、智能体评估 |

| 罗军   | 华为诺亚方舟实验室 | [[HomePage](https://openreview.net/profile?id=~Jun_Luo1)] | 自动驾驶、强化学习                       |

| 王祥丰 | 华东师范大学       | [[HomePage](https://mail-ecnu.cn/people/xfwang)]          | 多智体强化学习                           |

| 俞扬   | 南京大学           | [[HomePage](https://www.yuque.com/eyounx/home)]           | 强化学习、离线强化学习                   |

| 杨耀东 | 北京大学           | [[HomePage](https://www.yangyaodong.com/)]                | 多智能体强化学习、博弈论                 |

| 卢宗青 | 北京大学           | [[HomePage](https://z0ngqing.github.io/)]                 | 强化学习                                 |

| 张崇洁 | 清华大学           | [[HomePage](http://people.iiis.tsinghua.edu.cn/~zhang/)]  | 深度强化学习、多智能体                   |

### Abroad Langya Rank

| Name 
| -------------------- 
| Sergey Levine 
| Piter Abbeel 
| Matthew E. Taylor 
| Peter Stone 
| Shimon Whiteson 
| Jan Peters 
| Shie Mannor 
| Chelsea Finn 
| Dusit Niyato 
| Doina Precup 
| Ann Nowé 
| Marcello Restelli 
| Frank L. Lewis 
| H. Vincent Poor 
| Vaneet Aggarwal 
| F. Richard Yu 
| Jun Wang 
| Michael L. Littman   | 
| Satinder Singh 
| Mehdi Bennis 
| David Silver 
| Rémi Munos 
| Marc G. Bellemare    | 
| Joelle Pineau 
| Martin 
| Mohsen Guizani 
| Stefan Wermter 
| Ying-Chang Liang     | 
| Jonathan P. How 
| Ivana Dusparic 
| Robert Babuska 
| Emma Brunskill 
| Bo An

| Organization                                                 |                             Link                             | | ------------------------------------------------------------ | :----------------------------------------------------------: | | UC Berkeley                                                  | [[Google Scholar](https://scholar.google.com/citations?user=8R35rCwAAAAJ&hl=zh-CN&oi=ao)] | | UC Berkeley                                                  | [[Google Scholar](https://scholar.google.com/citations?user=vtwH6GkAAAAJ&hl=zh-CN)] | | University of Alberta                                        | [[Google Scholar](https://scholar.google.com/citations?user=edQgLXcAAAAJ&hl=zh-CN&oi=ao)] | | University of Texas at Austin                                | [[Google Scholar](https://scholar.google.com/citations?user=qnwjcfAAAAAJ&hl=zh-CN&oi=ao)] | | University of Oxford / Waymo                                 | [[Google Scholar](https://scholar.google.com/citations?user=9zeEI-cAAAAJ&hl=zh-CN&oi=ao)] | | German AI Research Center                                    | [[Google Scholar](https://scholar.google.com/citations?user=-kIVAcAAAAAJ&hl=zh-CN&oi=ao)] | | Nvidia                                                       | [[Google Scholar](https://scholar.google.com/citations?user=q1HlbIUAAAAJ&hl=zh-CN&oi=ao)] | | Stanford University / Google                                 | [[Google Scholar](https://scholar.google.com/citations?user=vfPE6hgAAAAJ&hl=zh-CN&oi=ao)] | |                                                              |                       [Google Scholar]                       | | DeepMind / McGill University                                 | [[Google Scholar](https://scholar.google.com/citations?user=j54VcVEAAAAJ&hl=zh-CN&oi=ao)] | |                                                              | [[Google Scholar](https://scholar.google.com/citations?user=LH5QKbgAAAAJ&hl=zh-CN&oi=ao)] | | Politecnico di Milano                                        | [[Google Scholar](https://scholar.google.com/citations?user=xdgxRiEAAAAJ&hl=zh-CN&oi=ao)] | |                                                              |                       [Google Scholar]                       | |                                                              |                       [Google Scholar]                       | | Purdue University                                            | [[Google Scholar](https://scholar.google.com/citations?user=Tu4lmGwAAAAJ&hl=zh-CN&oi=ao)] | | Carleton University                                          | [[Google Scholar](https://scholar.google.com/citations?user=zuGMGBoAAAAJ&hl=zh-CN&oi=ao)] | | University College London                                    | [[Google Scholar](https://scholar.google.com/citations?hl=zh-CN&user=wIE1tY4AAAAJ)] | |                       [Google Scholar]                       | | University of Michigan                                       |                       [Google Scholar]                       | |                                                              |                       [Google Scholar]                       | | University College London / DeepMind                         |                       [Google Scholar]                       | |                                                              |                       [Google Scholar]                       | |                       [Google Scholar]                       | | McGill University / Meta AI                                  | [[Google Scholar](https://scholar.google.com/citations?user=CEt6_mMAAAAJ&hl=zh-CN&oi=ao)] | A. Riedmiller | Google                                                       | [[Google Scholar](https://scholar.google.com/citations?hl=zh-CN&user=1gVfqpcAAAAJ&view_op=list_works&sortby=pubdate)] | | Mohamed Bin Zayed University of Artificial Intelligence      | [[Google Scholar]](https://scholar.google.com/citations?hl=zh-CN&user=RigrYkcAAAAJ&view_op=list_works&sortby=pubdate) | | University of Hamburg                                        | [[Google Scholar](https://scholar.google.com/citations?user=uIeaxuAAAAAJ&hl=zh-CN&oi=ao)] | | [[Google Scholar](https://scholar.google.com/citations?user=HybIiJ8AAAAJ&hl=zh-CN&oi=ao)] | |                                                              |                       [Google Scholar]                       | | Trinity College Dublin                                       | [[Google Scholar](https://scholar.google.com/citations?user=CrGGAccAAAAJ&hl=zh-CN&oi=ao)] | | Delft University of Technology / Czech Technical University Prague | [[Google Scholar](https://scholar.google.com/citations?user=0orN2FUAAAAJ&hl=zh-CN&oi=ao)] | | Stanford University                                          | [[Google Scholar](https://scholar.google.com/citations?user=HaN8b2YAAAAJ&hl=zh-CN&oi=ao)] | | Nanyang Technological University                             | [[Google Scholar](https://scholar.google.com/citations?user=PEEpuNwAAAAJ&hl=zh-CN&oi=ao)] |

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/allenpandas/tutorial4rl

Awesome Lists containing this project

README