https://github.com/allenpandas/tutorial4rl
Tutorial4RL: Tutorial for Reinforcement Learning. 强化学习入门教程.
https://github.com/allenpandas/tutorial4rl
a3c ddpg deep-reinforcement-learning dqn inverse-reinforcement-learning multi-agent-reinforcement-learning multi-agent-systems policy-gradient qlearning reinforcement-learning reinforcementlearning-tutorial rl-tutorial sarsa tutorial
Last synced: 3 months ago
JSON representation
Tutorial4RL: Tutorial for Reinforcement Learning. 强化学习入门教程.
- Host: GitHub
- URL: https://github.com/allenpandas/tutorial4rl
- Owner: Allenpandas
- License: mit
- Created: 2023-01-07T07:52:38.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2024-03-27T01:34:54.000Z (about 1 year ago)
- Last Synced: 2025-01-24T02:19:44.124Z (4 months ago)
- Topics: a3c, ddpg, deep-reinforcement-learning, dqn, inverse-reinforcement-learning, multi-agent-reinforcement-learning, multi-agent-systems, policy-gradient, qlearning, reinforcement-learning, reinforcementlearning-tutorial, rl-tutorial, sarsa, tutorial
- Homepage:
- Size: 4.17 MB
- Stars: 128
- Watchers: 6
- Forks: 12
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Tutorial4RL
Tutorial4RL: Tutorial for Reinforcement Learning. 强化学习入门教程.## Related Repository
| Repository | Remark |
| ------------------------------------------------------------ | ------------------------------------------------------------ |
| [Awesome-Reinforcement-Learning-Papers](https://github.com/Allenpandas/Awesome-Reinforcement-Learning-Papers) |![]()
![]()
|
| [Tutorial4RL](https://github.com/Allenpandas/Tutorial4RL) |![]()
![]()
|
| [2023-Reinforcement-Learning-Conferences-Papers](https://github.com/Allenpandas/2023-Reinforcement-Learning-Conferences-Papers) |![]()
![]()
|
| [2022-Reinforcement-Learning-Conferences-Papers](https://github.com/Allenpandas/2022-Reinforcement-Learning-Conferences-Papers) |![]()
![]()
|
| [2021-Reinforcement-Learning-Conferences-Papers](https://github.com/Allenpandas/2021-Reinforcement-Learning-Conferences-Papers) |![]()
![]()
|
| [2020-Reinforcement-Learning-Conferences-Papers](https://github.com/Allenpandas/2020-Reinforcement-Learning-Conferences-Papers) |![]()
![]()
|
| [2019-Reinforcement-Learning-Conferences-Papers](https://github.com/Allenpandas/2019-Reinforcement-Learning-Conferences-Papers) |![]()
![]()
|
| [2018-Reinforcement-Learning-Conferences-Papers](https://github.com/Allenpandas/2018-Reinforcement-Learning-Conferences-Papers) |![]()
![]()
|
| [2017-Reinforcement-Learning-Conferences-Papers](https://github.com/Allenpandas/2017-Reinforcement-Learning-Conferences-Papers) |![]()
![]()
|
## Open Source Projects
- PFRL:基于Pytorch的深度强化学习库: [https://github.com/pfnet/pfrl](https://github.com/pfnet/pfrl)
- 莫烦强化学习TensorFlow代码: [https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow](https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow)
- 百度飞桨PaddlePaddle强化学习代码: [https://github.com/PaddlePaddle/PARL](https://github.com/PaddlePaddle/PARL)
- Github强大的强化学习库: [https://github.com/wwxFromTju/awesome-reinforcement-learning-lib](https://github.com/wwxFromTju/awesome-reinforcement-learning-lib)
- 优达学城(在线教育平台)强化学习库: [https://github.com/udacity/deep-reinforcement-learning](https://github.com/udacity/deep-reinforcement-learning)## Books & Videos
- 《深度强化学习》王树森: [https://www.bilibili.com/video/BV12o4y197US](https://www.bilibili.com/video/BV12o4y197US)
- 《Deep Reinforcement Learning》李宏毅: [https://www.bilibili.com/video/BV1UE411G78S](https://www.bilibili.com/video/BV1UE411G78S)
- 《世界冠军带你从零实践强化学习》百度飞桨团队: [https://www.bilibili.com/video/BV1yv411i7xd](https://www.bilibili.com/video/BV1yv411i7xd)
- 《强化学习白板推导》:[https://space.bilibili.com/97068901/channel/seriesdetail?sid=594040](https://space.bilibili.com/97068901/channel/seriesdetail?sid=594040)
- 《蘑菇书EasyRL》王琦等: [https://github.com/datawhalechina/easy-rl](https://github.com/datawhalechina/easy-rl)
- 《动手学强化学习》张伟楠等: [http://hrl.boyuai.com/](http://hrl.boyuai.com/)## Relevant Conferences
| Abbr. | Full Name | CCF Rank |
| ------- | ------------------------------------------------------------ | :------: |
| ICML | International Conference on Machine Learning | CCF-A |
| NeurIPS | Annual Conference on Neural Information Processing Systems | CCF-A |
| ICLR | International Conference on Learning Representations | — |
| AAAI | AAAI Conference on Artificial Intelligence | CCF-A |
| IJCAI | International Joint Conference on Artificial Intelligence | CCF-A |
| AAMAS | International Joint Conference on Autonomous Agents and Multi-agent Systems | CCF-B |
| ICRA | IEEE International Conference on Robotics and Automation | CCF-B |## Community
- RLChina强化学习社区: [http://rlchina.org/](http://rlchina.org/)
- 智源社区强化学习专栏: [https://hub.baai.ac.cn/?tag_id=74](https://hub.baai.ac.cn/?tag_id=74)
- 智源社区强化学习周刊: [https://hub.baai.ac.cn/users/18447](https://hub.baai.ac.cn/users/18447)## Langya Rank
### Domestic Langya Rank
| Name | Organization | Link | Focus |
| ------ | ------------------ | --------------------------------------------------------- | ---------------------------------------- |
| 郝建业 | 天津大学 | [[HomePage](http://www.icdai.org/jianye.html)] | 多智能体强化学习、博弈论 |
| 张海峰 | 中科院自动化所 | [[HomePage](https://people.ucas.edu.cn/~zhf)] | 多智能体强化学习、智能体博弈、智能体评估 |
| 罗军 | 华为诺亚方舟实验室 | [[HomePage](https://openreview.net/profile?id=~Jun_Luo1)] | 自动驾驶、强化学习 |
| 王祥丰 | 华东师范大学 | [[HomePage](https://mail-ecnu.cn/people/xfwang)] | 多智体强化学习 |
| 俞扬 | 南京大学 | [[HomePage](https://www.yuque.com/eyounx/home)] | 强化学习、离线强化学习 |
| 杨耀东 | 北京大学 | [[HomePage](https://www.yangyaodong.com/)] | 多智能体强化学习、博弈论 |
| 卢宗青 | 北京大学 | [[HomePage](https://z0ngqing.github.io/)] | 强化学习 |
| 张崇洁 | 清华大学 | [[HomePage](http://people.iiis.tsinghua.edu.cn/~zhang/)] | 深度强化学习、多智能体 |### Abroad Langya Rank
| Name | Organization | Link |
| -------------------- | ------------------------------------------------------------ | :----------------------------------------------------------: |
| Sergey Levine | UC Berkeley | [[Google Scholar](https://scholar.google.com/citations?user=8R35rCwAAAAJ&hl=zh-CN&oi=ao)] |
| Piter Abbeel | UC Berkeley | [[Google Scholar](https://scholar.google.com/citations?user=vtwH6GkAAAAJ&hl=zh-CN)] |
| Matthew E. Taylor | University of Alberta | [[Google Scholar](https://scholar.google.com/citations?user=edQgLXcAAAAJ&hl=zh-CN&oi=ao)] |
| Peter Stone | University of Texas at Austin | [[Google Scholar](https://scholar.google.com/citations?user=qnwjcfAAAAAJ&hl=zh-CN&oi=ao)] |
| Shimon Whiteson | University of Oxford / Waymo | [[Google Scholar](https://scholar.google.com/citations?user=9zeEI-cAAAAJ&hl=zh-CN&oi=ao)] |
| Jan Peters | German AI Research Center | [[Google Scholar](https://scholar.google.com/citations?user=-kIVAcAAAAAJ&hl=zh-CN&oi=ao)] |
| Shie Mannor | Nvidia | [[Google Scholar](https://scholar.google.com/citations?user=q1HlbIUAAAAJ&hl=zh-CN&oi=ao)] |
| Chelsea Finn | Stanford University / Google | [[Google Scholar](https://scholar.google.com/citations?user=vfPE6hgAAAAJ&hl=zh-CN&oi=ao)] |
| Dusit Niyato | | [Google Scholar] |
| Doina Precup | DeepMind / McGill University | [[Google Scholar](https://scholar.google.com/citations?user=j54VcVEAAAAJ&hl=zh-CN&oi=ao)] |
| Ann Nowé | | [[Google Scholar](https://scholar.google.com/citations?user=LH5QKbgAAAAJ&hl=zh-CN&oi=ao)] |
| Marcello Restelli | Politecnico di Milano | [[Google Scholar](https://scholar.google.com/citations?user=xdgxRiEAAAAJ&hl=zh-CN&oi=ao)] |
| Frank L. Lewis | | [Google Scholar] |
| H. Vincent Poor | | [Google Scholar] |
| Vaneet Aggarwal | Purdue University | [[Google Scholar](https://scholar.google.com/citations?user=Tu4lmGwAAAAJ&hl=zh-CN&oi=ao)] |
| F. Richard Yu | Carleton University | [[Google Scholar](https://scholar.google.com/citations?user=zuGMGBoAAAAJ&hl=zh-CN&oi=ao)] |
| Jun Wang | University College London | [[Google Scholar](https://scholar.google.com/citations?hl=zh-CN&user=wIE1tY4AAAAJ)] |
| Michael L. Littman | | [Google Scholar] |
| Satinder Singh | University of Michigan | [Google Scholar] |
| Mehdi Bennis | | [Google Scholar] |
| David Silver | University College London / DeepMind | [Google Scholar] |
| Rémi Munos | | [Google Scholar] |
| Marc G. Bellemare | | [Google Scholar] |
| Joelle Pineau | McGill University / Meta AI | [[Google Scholar](https://scholar.google.com/citations?user=CEt6_mMAAAAJ&hl=zh-CN&oi=ao)] |
| Martin A. Riedmiller | Google | [[Google Scholar](https://scholar.google.com/citations?hl=zh-CN&user=1gVfqpcAAAAJ&view_op=list_works&sortby=pubdate)] |
| Mohsen Guizani | Mohamed Bin Zayed University of Artificial Intelligence | [[Google Scholar]](https://scholar.google.com/citations?hl=zh-CN&user=RigrYkcAAAAJ&view_op=list_works&sortby=pubdate) |
| Stefan Wermter | University of Hamburg | [[Google Scholar](https://scholar.google.com/citations?user=uIeaxuAAAAAJ&hl=zh-CN&oi=ao)] |
| Ying-Chang Liang | | [[Google Scholar](https://scholar.google.com/citations?user=HybIiJ8AAAAJ&hl=zh-CN&oi=ao)] |
| Jonathan P. How | | [Google Scholar] |
| Ivana Dusparic | Trinity College Dublin | [[Google Scholar](https://scholar.google.com/citations?user=CrGGAccAAAAJ&hl=zh-CN&oi=ao)] |
| Robert Babuska | Delft University of Technology / Czech Technical University Prague | [[Google Scholar](https://scholar.google.com/citations?user=0orN2FUAAAAJ&hl=zh-CN&oi=ao)] |
| Emma Brunskill | Stanford University | [[Google Scholar](https://scholar.google.com/citations?user=HaN8b2YAAAAJ&hl=zh-CN&oi=ao)] |
| Bo An | Nanyang Technological University | [[Google Scholar](https://scholar.google.com/citations?user=PEEpuNwAAAAJ&hl=zh-CN&oi=ao)] |