Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

awesome-reinforcement-learning

Learning Resources And Links Of Reinforcement Learning （updating）
https://github.com/tinyzqh/awesome-reinforcement-learning

Last synced: 3 days ago
JSON representation

Uncategorized
- Uncategorized
论文
- Implementation of Algorithms
  - DQN-nature - Network ); Mnih et al, 2015
  - AlphaZero-nature
  - DQN-nature - Network ); Mnih et al, 2015
  - AlphaZero-nature
  - Control of Memory, Active Perception, and Action in Minecraft
  - DQN-nature - Network ); Mnih et al, 2015
  - AlphaZero-nature
  - TCN - Contrastive Networks):Sermanet, et al, 2017
  - Reinforcement and Imitation Learning
  - Prioritized experience replay
  - DQN-nature - Network ); Mnih et al, 2015
  - DQN-nature - Network ); Mnih et al, 2015
  - AlphaZero-nature
  - DQN-nature - Network ); Mnih et al, 2015
  - AlphaZero-nature
  - DQN-nature - Network ); Mnih et al, 2015
  - AlphaZero-nature
  - DQN-nature - Network ); Mnih et al, 2015
  - AlphaZero-nature
  - DQN-nature - Network ); Mnih et al, 2015
  - AlphaZero-nature
  - AlphaZero-nature
  - DQN-arxiv - Networks ): Mnih et al, 2013
  - DQN-nature - Network ); Mnih et al, 2015
  - AlphaZero-nature
  - A2C / A3C - Critic): Mnih et al, 2016
  - DQN-nature - Network ); Mnih et al, 2015
  - DQN-nature - Network ); Mnih et al, 2015
  - Double DQN
  - Dueling DQN
  - QR-DQN
  - Alpha Go
  - AlphaZero-nature
  - SAC - Policy Maximum Entropy): Haarnoja et al, 2018
  - SAC
  - PPO
  - TRPO
  - DPG
  - DDPG
  - TD3
  - NAF
  - C51 - Atom DQN): Bellemare et al, 2017
  - HER
  - World Models
  - DQN-nature - Network ); Mnih et al, 2015
  - AlphaZero-nature
  - PathNet
  - Reinforcement and Imitation Learning
  - Unifying Count-Based Exploration and Intrinsic Motivation
  - DQN-nature - Network ); Mnih et al, 2015
  - AlphaZero-nature
  - DQN-nature - Network ); Mnih et al, 2015
  - AlphaZero-nature
  - Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models
  - Action-Conditional Video Prediction using Deep Networks in Atari Games
  - DQN-nature - Network ); Mnih et al, 2015
  - AlphaZero-nature
  - I2A - Augmented Agents): Weber et al, 2017
  - MBMF - Based RL with Model-Free Fine-Tuning): Nagabandi et al, 2017
  - MBVE - Based Value Expansion): Feinberg et al, 2018
  - PathNet
  - DQN-nature - Network ); Mnih et al, 2015
  - AlphaZero-nature
  - DQN-nature - Network ); Mnih et al, 2015
  - AlphaZero-nature
  - Unifying Count-Based Exploration and Intrinsic Motivation
  - DQN-nature - Network ); Mnih et al, 2015
  - DQN-nature - Network ); Mnih et al, 2015
  - AlphaZero-nature
  - DQN-nature - Network ); Mnih et al, 2015
  - AlphaZero-nature
  - DQN-nature - Network ); Mnih et al, 2015
  - AlphaZero-nature
  - DQN-nature - Network ); Mnih et al, 2015
  - AlphaZero-nature
  - DQN-nature - Network ); Mnih et al, 2015
  - AlphaZero-nature
  - Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models
  - Action-Conditional Video Prediction using Deep Networks in Atari Games
  - DQN-nature - Network ); Mnih et al, 2015
  - AlphaZero-nature
  - Policy distillation
  - DQN-nature - Network ); Mnih et al, 2015
  - AlphaZero-nature
  - DQN-nature - Network ); Mnih et al, 2015
  - AlphaZero-nature
  - DQN-nature - Network ); Mnih et al, 2015
  - AlphaZero-arxiv - Play) :Silver et al, 2017
  - AlphaZero-nature
  - NAF
  - DQN-nature - Network ); Mnih et al, 2015
  - AlphaZero-nature
  - DQN-nature - Network ); Mnih et al, 2015
  - AlphaZero-nature
  - DQN-nature - Network ); Mnih et al, 2015
  - AlphaZero-nature
  - DQN-nature - Network ); Mnih et al, 2015
  - AlphaZero-nature
强化学习实战资源
- Implementation of Algorithms
  - 教程 | 如何在Unity环境中用强化学习训练Donkey Car
  - 深入浅出解读"多巴胺（Dopamine）论文"、环境配置和实例分析
Awesome
- 强化学习从入门到放弃的资料
Algorithm Repos
Project
- Implementation of Algorithms

Programming Languages

HTML 1 Jupyter Notebook 1

Categories

论文 98 Uncategorized 30 Project 3 Algorithm Repos 3 强化学习实战资源 2 Awesome 1

Sub Categories

Implementation of Algorithms 103 Uncategorized 30

Keywords

deep-reinforcement-learning 2 reinforcement-learning 2 openai-gym 2 asynchronous-advantage-actor-critic 1 deep-deterministic-policy-gradient 1 deep-learning-algorithms 1 deep-q-network 1 deep-recurrent-q-network 1 double-dqn 1 drqn 1 dueling-dqn 1 hindsight-experience-replay 1 markov-decision-processes 1 monte-carlo 1 policy-gradient 1 policy-gradients 1 ppo 1 q-learning 1 sarsa 1 trpo 1 gym 1 python 1 pytorch 1 tensorflow 1 tensorflow2 1