Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
awesome-reinforcement-learning
Learning Resources And Links Of Reinforcement Learning (updating)
https://github.com/tinyzqh/awesome-reinforcement-learning
Last synced: 3 days ago
JSON representation
-
Uncategorized
-
Uncategorized
- Hands-On Reinforcement Learning With Python
- Reinforcement Learning: Theory and Python Implementation
- CS294课程中文笔记-1
- 加州大学伯克利分校机器人学专家 Sergey Levine
- Google DeepMind AlphaGo项目的主程序员 David Silver 博士
- 机器博弈专家Tuomas Sandholm教授
- 强化学习系列教程
- 南京大学俞扬博士万字演讲全文:强化学习前沿
- OpenAi Spinning Up
- An Introduction to Deep Reinforcement Learning
- Foundations and Trends® in Machine Learning
- REINFORCEMENT LEARNING AND OPTIMAL CONTROL
- David Silver's course
- David视频里所使用的讲义pdf
- John Schulmann's lectures
- Deep RL Bootcamp
- CS 287: Advanced Robotics, Fall 2015
- CS234: Reinforcement Learning Winter 2019
- Deep Learning (DLSS) and Reinforcement Learning (RLSS) Summer School, Montreal 2017
- Play pong with deep reinforcement learning based on pixel
- Deep Learning in a Nutshell: Reinforcement Learning
- AlphaGo
- 加拿大阿尔伯塔大学著名增强学习大师Richard S. Sutton 教授
- 强化学习知识大讲堂
- 强化学习系列教程
- 强化学习教程(莫烦)
- 作业讲解
- David Silver《深度强化学习》公开课教程学习笔记以及实战
- 中文翻译-2018秋季CS294-112深度强化学习
- 强化学习系列教程
-
-
论文
-
Implementation of Algorithms
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- Control of Memory, Active Perception, and Action in Minecraft
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- TCN - Contrastive Networks):Sermanet, et al, 2017
- Reinforcement and Imitation Learning
- Prioritized experience replay
- DQN-nature - Network ); Mnih et al, 2015
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- AlphaZero-nature
- DQN-arxiv - Networks ): Mnih et al, 2013
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- A2C / A3C - Critic): Mnih et al, 2016
- DQN-nature - Network ); Mnih et al, 2015
- DQN-nature - Network ); Mnih et al, 2015
- Double DQN
- Dueling DQN
- QR-DQN
- Alpha Go
- AlphaZero-nature
- SAC - Policy Maximum Entropy): Haarnoja et al, 2018
- SAC
- PPO
- TRPO
- DPG
- DDPG
- TD3
- NAF
- C51 - Atom DQN): Bellemare et al, 2017
- HER
- World Models
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- PathNet
- Reinforcement and Imitation Learning
- Unifying Count-Based Exploration and Intrinsic Motivation
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models
- Action-Conditional Video Prediction using Deep Networks in Atari Games
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- I2A - Augmented Agents): Weber et al, 2017
- MBMF - Based RL with Model-Free Fine-Tuning): Nagabandi et al, 2017
- MBVE - Based Value Expansion): Feinberg et al, 2018
- PathNet
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- Unifying Count-Based Exploration and Intrinsic Motivation
- DQN-nature - Network ); Mnih et al, 2015
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models
- Action-Conditional Video Prediction using Deep Networks in Atari Games
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- Policy distillation
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-arxiv - Play) :Silver et al, 2017
- AlphaZero-nature
- NAF
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
-
-
强化学习实战资源
-
Implementation of Algorithms
-
-
Awesome
-
Algorithm Repos
-
Project
-
Implementation of Algorithms
-
Programming Languages
Sub Categories
Keywords
deep-reinforcement-learning
2
reinforcement-learning
2
openai-gym
2
asynchronous-advantage-actor-critic
1
deep-deterministic-policy-gradient
1
deep-learning-algorithms
1
deep-q-network
1
deep-recurrent-q-network
1
double-dqn
1
drqn
1
dueling-dqn
1
hindsight-experience-replay
1
markov-decision-processes
1
monte-carlo
1
policy-gradient
1
policy-gradients
1
ppo
1
q-learning
1
sarsa
1
trpo
1
gym
1
python
1
pytorch
1
tensorflow
1
tensorflow2
1