Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
awesome-rl
Awesome RL: Papers, Books, Codes, Benchmarks
https://github.com/dbobrenko/awesome-rl
- [Stable Baselines3
- [Baselines @ OpenAI
- [Baselines @ DLR-RM
- [RLlib @ Ray
- [Dopamine @ Google
- [TensorForce
- [pytorch-a2c-ppo-acktr
- [OpenAI Benchmarks for PPO, A2C, ACKTR, ACER
- [OpenAI Benchmarks for DQN, Double DQN, Dueling DQN, Prioritized DQN
- [Google Benchmarks for Rainbow, c51, IQN, DQN
- [Soft Actor Critic - actor-critic-deep-reinforcement.html)] [[code](https://github.com/rail-berkeley/softlearning/)] 2018 @ Google Brain, UC Berkeley
- [IMPALA
- [Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR, A2C)
- [Proximal Policy Optimization Algorithms (PPO) - baselines-ppo/)] 2017 @ OpenAI
- [High-dimensional continuous control using generalized advantage estimation (GAE)
- [Trust Region Policy Optimization (TRPO)
- [Actor-Critic Algorithms, pdf
- [Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning (REINFORCE), pdf
- [Implicit Quantile Networks for Distributional Reinforcement Learning (IQN)
- [A Distributional Perspective on Reinforcement Learning (c51)
- [Rainbow: Combining Improvements in Deep Reinforcement Learning
- [Dueling Network Architectures for Deep Reinforcement Learning (Dueling DQN)
- [Deep Reinforcement Learning with Double Q-learning (Double DQN)
- [Playing Atari with Deep Reinforcement Learning** (DQN)
- [Temporal Difference Learning and TD-Gammon, pdf
- [Model-Based Reinforcement Learning for Atari
- navigation
- locomotion - based-rl/)] [[code](https://github.com/nagaban2/nn_dynamics)] 2017 @ Berkeley
- locomotion - imagine-and-plan/)] 2017 @ Google DeepMind
- navigation
- [Back to Basics: Benchmarking Canonical Evolution Strategies for Playing Atari
- locomotion
- [Evolution Strategies as a Scalable Alternative to Reinforcement Learning
- [Evolving Large-Scale Neural Networks for Vision-Based Reinforcement Learning, pdf - SUPSI
- [Go-Explore
- [Exploration by Random Network Distillation (RND) - learning-with-prediction-based-rewards/)] [[code](https://github.com/openai/random-network-distillation)] 2018 @ OpenAI
- navigation - scale-curiosity/)] 2018 @ OpenAI, Berkeley, Univ. of Edinburgh
- [RUDDER: Return Decomposition for Delayed Rewards - jku/baselines-rudder)] 2018 @ Johannes Kepler Univ. Linz
- [Deep Curiosity Search
- locomotion
- transfer - imagine-and-plan/)] 2017 @ DeepMind
- table
- table - zero-learning-scratch/)] Silver et al., 2017 @ Deepmind
- table
- locomotion - a-hierarchy/)] Frans et al., 2017 @ OpenAI, Berkeley.
- [Hybrid Reward Architecture for Reinforcement Learning (HRA)
- [Learning with Opponent-Learning Awareness (LOLA) - to-model-other-minds/)] Foerster et al., 2017 @ OpenAI, Oxford, Berkeley, CMU
- manipulation
- manipulation
- manipulation
- [Learning to Navigate in Cities Without a Map
- [Human-level performance in first-person multiplayer games with population-based deep reinforcement learning - the-flag/)] Jaderberg et al, 2018 @ DeepMind
- generalization
- [Learning to Navigate in Complex Environments
- transfer
- meta-learning
- [Learning to act by predicting the future (VizDoom 2016 Full DM Winner)
- [Playing FPS Games with Deep Reinforcement Learning (VizDoom 2016 Limited DM 2nd place)
- generalization - dexterity/)] Andrychowicz et al., 2018 @ OpenAI
- generalization - from-simulation/)] Pinto et al., 2017 @ OpenAI, CMU
- generalization - from-simulation/)] Peng et al., 2017 @ OpenAI, Berkeley
- [Emergence of Locomotion Behaviours in Rich Environments - flexible-behaviours-simulated-environments/)] Heess et al., 2017 @ DeepMind
- [Programmable Agents
- [AutoAugment: Learning Augmentation Policies from Data
- evolution
- [Learning Transferable Architectures for Scalable Image Recognition
- [Neural Optimizer Search with Reinforcement Learning, pdf
- [Neural Architecture Search with Reinforcement Learning
- [A Deep Reinforcement Learning Chatbot
- [Reinforcement Learning: An Introduction, pdf
- [A Brief Survey of Deep Reinforcement Learning
- [How to Read a Paper
- Transfromers: [Attention is all you need
Programming Languages
Keywords
pytorch
2
reinforcement-learning
2
tensorflow
1
rl
1
ml
1
google
1
ai
1
toolbox
1
stable-baselines
1
sde
1
sb3
1
robotics
1
reinforcement-learning-algorithms
1
python
1
openai
1
machine-learning
1
gym
1
gsde
1
second-order
1
roboschool
1
proximal-policy-optimization
1
ppo
1
natural-gradients
1
mujoco
1
kronecker-factored-approximation
1
kfac
1
hessian
1
deep-reinforcement-learning
1
deep-learning
1
continuous-control
1
atari
1
ale
1
advantage-actor-critic
1
actor-critic
1
acktr
1
a2c
1
baselines
1