awesome-reinforcement-learning
Learning Resources And Links Of Reinforcement Learning (updating)
https://github.com/tinyzqh/awesome-reinforcement-learning
Last synced: 5 days ago
JSON representation
-
Uncategorized
-
Uncategorized
- David Silver's course
- Deep RL Bootcamp
- Play pong with deep reinforcement learning based on pixel
- REINFORCEMENT LEARNING AND OPTIMAL CONTROL
- CS234: Reinforcement Learning Winter 2019
- 强化学习知识大讲堂
- 南京大学俞扬博士万字演讲全文:强化学习前沿
- OpenAi Spinning Up
- An Introduction to Deep Reinforcement Learning
- Foundations and Trends® in Machine Learning
- 中文翻译-2018秋季CS294-112深度强化学习
- CS294课程中文笔记-1
- 作业讲解
- David视频里所使用的讲义pdf
- John Schulmann's lectures
- CS 287: Advanced Robotics, Fall 2015
- Deep Learning (DLSS) and Reinforcement Learning (RLSS) Summer School, Montreal 2017
- 强化学习教程(莫烦)
- Deep Learning in a Nutshell: Reinforcement Learning
- AlphaGo
- 加州大学伯克利分校机器人学专家 Sergey Levine
- 加拿大阿尔伯塔大学著名增强学习大师Richard S. Sutton 教授
- Google DeepMind AlphaGo项目的主程序员 David Silver 博士
- 机器博弈专家Tuomas Sandholm教授
- 强化学习系列教程
- 强化学习系列教程
- Hands-On Reinforcement Learning With Python
- Reinforcement Learning: Theory and Python Implementation
- Advanced Deep Learning and Reinforcement Learning
- David Silver《深度强化学习》公开课教程学习笔记以及实战
- 南京大学俞扬博士万字演讲全文:强化学习前沿
- David Silver《深度强化学习》公开课教程学习笔记以及实战
- David Silver's course
- Play pong with deep reinforcement learning based on pixel
- Deep Learning (DLSS) and Reinforcement Learning (RLSS) Summer School, Montreal 2017
- CS294课程中文笔记-2 (记录较好)
- REINFORCEMENT LEARNING AND OPTIMAL CONTROL
- Deep Learning in a Nutshell: Reinforcement Learning
- Google DeepMind AlphaGo项目的主程序员 David Silver 博士
- 机器博弈专家Tuomas Sandholm教授
- CS 294: Deep Reinforcement Learning
-
-
论文
-
Implementation of Algorithms
- SAC
- PPO
- TRPO
- C51 - Atom DQN): Bellemare et al, 2017
- Dueling DQN
- Double DQN
- I2A - Augmented Agents): Weber et al, 2017
- AlphaZero-arxiv - Play) :Silver et al, 2017
- World Models
- MBMF - Based RL with Model-Free Fine-Tuning): Nagabandi et al, 2017
- MBVE - Based Value Expansion): Feinberg et al, 2018
- PathNet
- SAC - Policy Maximum Entropy): Haarnoja et al, 2018
- Policy distillation
- A2C / A3C - Critic): Mnih et al, 2016
- DDPG
- Prioritized experience replay
- DPG
- HER
- DQN-arxiv - Networks ): Mnih et al, 2013
- DQN-nature - Network ); Mnih et al, 2015
- QR-DQN
- Alpha Go
- AlphaZero-nature
- TD3
- NAF
- TCN - Contrastive Networks):Sermanet, et al, 2017
- Reinforcement and Imitation Learning
- Unifying Count-Based Exploration and Intrinsic Motivation
- Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models
- Action-Conditional Video Prediction using Deep Networks in Atari Games
- Control of Memory, Active Perception, and Action in Minecraft
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- DQN-nature - Network ); Mnih et al, 2015
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- DQN-nature - Network ); Mnih et al, 2015
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- NAF
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- DQN-nature - Network ); Mnih et al, 2015
- AlphaZero-nature
- PathNet
- Reinforcement and Imitation Learning
- Unifying Count-Based Exploration and Intrinsic Motivation
- Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models
- Action-Conditional Video Prediction using Deep Networks in Atari Games
- plannet
- AlphaZero-nature
- DPG
- DQN-nature - Network ); Mnih et al, 2015
- DQN-nature - Network ); Mnih et al, 2015
- DQN-nature - Network ); Mnih et al, 2015
- DQN-nature - Network ); Mnih et al, 2015
- DQN-nature - Network ); Mnih et al, 2015
- DQN-nature - Network ); Mnih et al, 2015
- DQN-nature - Network ); Mnih et al, 2015
- DQN-nature - Network ); Mnih et al, 2015
- DQN-nature - Network ); Mnih et al, 2015
- DQN-nature - Network ); Mnih et al, 2015
- DQN-nature - Network ); Mnih et al, 2015
- DQN-nature - Network ); Mnih et al, 2015
- DQN-nature - Network ); Mnih et al, 2015
- DQN-nature - Network ); Mnih et al, 2015
- DQN-nature - Network ); Mnih et al, 2015
- DQN-nature - Network ); Mnih et al, 2015
- DQN-nature - Network ); Mnih et al, 2015
- DQN-nature - Network ); Mnih et al, 2015
-
-
Algorithm Repos
-
Project
-
Implementation of Algorithms
- DeepMimic: Example-Guided Deep Reinforcement Learning of Physics-Based Character Skills
- SFV
- [code
- StarCraft II - pysc2 Deep Reinforcement Learning Examples
- An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
- Using reinforcement learning to teach a car to avoid obstacles
- SFV
- A reinforcement learning algorithm for the 2048 game
-
-
Awesome
- 强化学习从入门到放弃的资料
- Reinforcement learning resources curated
- This project is for learning and researching on Deep RL. Maintained by University AI researchers
- Reinforcement Learning Notebooks
- Awesome Reinforcement Learning(RL) for Natural Language Processing(NLP))
- TensorFlow implementation of Deep Reinforcement Learning papers
- A list of recent papers regarding deep reinforcement learning
- Paper list of multi-agent reinforcement learning (MARL) )
- Deep Reinforcement Learning(深度强化学习)
-
强化学习实战资源
-
Implementation of Algorithms
- 教程 | 如何在Unity环境中用强化学习训练Donkey Car
- 深入浅出解读"多巴胺(Dopamine)论文"、环境配置和实例分析
- Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course
- Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch
- Pytorch Implementation of DQN / DDQN / Prioritized replay/ noisy networks/ distributional values/ Rainbow/ hierarchical RL
- PyTorch 实现 DQN, AC, A2C, A3C, , Policy Gradient, DDPG, TRPO, PPO, ACER
- Repo for the Deep Reinforcement Learning Nanodegree program
- Deep Reinforcement learning framework
- PyTorch implementations of various DRL algorithms for both single agent and multi-agent
- Codes for understanding Reinforcement Learning( updating... )
- 教程 | 如何在Unity环境中用强化学习训练Donkey Car
- 深入浅出解读"多巴胺(Dopamine)论文"、环境配置和实例分析
- 教程 | 如何在Unity环境中用强化学习训练Donkey Car
- 深入浅出解读"多巴胺(Dopamine)论文"、环境配置和实例分析
- 教程 | 如何在Unity环境中用强化学习训练Donkey Car
- 深入浅出解读"多巴胺(Dopamine)论文"、环境配置和实例分析
- 教程 | 如何在Unity环境中用强化学习训练Donkey Car
- 深入浅出解读"多巴胺(Dopamine)论文"、环境配置和实例分析
- 教程 | 如何在Unity环境中用强化学习训练Donkey Car
- 深入浅出解读"多巴胺(Dopamine)论文"、环境配置和实例分析
- 教程 | 如何在Unity环境中用强化学习训练Donkey Car
- 深入浅出解读"多巴胺(Dopamine)论文"、环境配置和实例分析
- 教程 | 如何在Unity环境中用强化学习训练Donkey Car
- 深入浅出解读"多巴胺(Dopamine)论文"、环境配置和实例分析
- 教程 | 如何在Unity环境中用强化学习训练Donkey Car
- 深入浅出解读"多巴胺(Dopamine)论文"、环境配置和实例分析
- 教程 | 如何在Unity环境中用强化学习训练Donkey Car
- 深入浅出解读"多巴胺(Dopamine)论文"、环境配置和实例分析
-
Programming Languages
Sub Categories
Keywords
reinforcement-learning
9
deep-reinforcement-learning
9
pytorch
5
tensorflow
4
ppo
4
machine-learning
3
openai-gym
3
policy-gradient
3
dqn
3
deep-q-network
3
actor-critic
2
double-dqn
2
multiagent-reinforcement-learning
2
deep-recurrent-q-network
2
alphago
2
gym
2
reinforcement-learning-algorithms
2
python
2
a2c
2
dueling-dqn
2
trpo
2
sarsa
2
q-learning
2
neural-networks
2
drqn
1
deep-learning-algorithms
1
deep-deterministic-policy-gradient
1
asynchronous-advantage-actor-critic
1
toolbox
1
hindsight-experience-replay
1
markov-decision-processes
1
keras
1
theano
1
aaai
1
aamas
1
agi
1
aistats
1
artificial-general-intelligence
1
distributional
1
exploration-exploitation
1
game
1
hierarchical-reinforcement-learning
1
iclr
1
icml
1
ijcai
1
inverse-rl
1
planning
1
reward
1
theoretical-computer-science
1
uai
1