Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

awesome-rl

Awesome RL: Papers, Books, Codes, Benchmarks
https://github.com/dbobrenko/awesome-rl

[Stable Baselines3
[Baselines @ OpenAI
[Baselines @ DLR-RM
[RLlib @ Ray
[Dopamine @ Google
[TensorForce
[pytorch-a2c-ppo-acktr
[OpenAI Benchmarks for PPO, A2C, ACKTR, ACER
[OpenAI Benchmarks for DQN, Double DQN, Dueling DQN, Prioritized DQN
[Google Benchmarks for Rainbow, c51, IQN, DQN
[Soft Actor Critic - actor-critic-deep-reinforcement.html)] [[code](https://github.com/rail-berkeley/softlearning/)] 2018 @ Google Brain, UC Berkeley
[IMPALA
[Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR, A2C)
[Proximal Policy Optimization Algorithms (PPO) - baselines-ppo/)] 2017 @ OpenAI
[High-dimensional continuous control using generalized advantage estimation (GAE)
[Trust Region Policy Optimization (TRPO)
[Actor-Critic Algorithms, pdf
[Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning (REINFORCE), pdf
[Implicit Quantile Networks for Distributional Reinforcement Learning (IQN)
[A Distributional Perspective on Reinforcement Learning (c51)
[Rainbow: Combining Improvements in Deep Reinforcement Learning
[Dueling Network Architectures for Deep Reinforcement Learning (Dueling DQN)
[Deep Reinforcement Learning with Double Q-learning (Double DQN)
[Playing Atari with Deep Reinforcement Learning** (DQN)
[Temporal Difference Learning and TD-Gammon, pdf
[Model-Based Reinforcement Learning for Atari
navigation
locomotion - based-rl/)] [[code](https://github.com/nagaban2/nn_dynamics)] 2017 @ Berkeley
locomotion - imagine-and-plan/)] 2017 @ Google DeepMind
navigation
[Back to Basics: Benchmarking Canonical Evolution Strategies for Playing Atari
locomotion
[Evolution Strategies as a Scalable Alternative to Reinforcement Learning
[Evolving Large-Scale Neural Networks for Vision-Based Reinforcement Learning, pdf - SUPSI
[Go-Explore
[Exploration by Random Network Distillation (RND) - learning-with-prediction-based-rewards/)] [[code](https://github.com/openai/random-network-distillation)] 2018 @ OpenAI
navigation - scale-curiosity/)] 2018 @ OpenAI, Berkeley, Univ. of Edinburgh
[RUDDER: Return Decomposition for Delayed Rewards - jku/baselines-rudder)] 2018 @ Johannes Kepler Univ. Linz
[Deep Curiosity Search
locomotion
transfer - imagine-and-plan/)] 2017 @ DeepMind
table
table - zero-learning-scratch/)] Silver et al., 2017 @ Deepmind
table
locomotion - a-hierarchy/)] Frans et al., 2017 @ OpenAI, Berkeley.
[Hybrid Reward Architecture for Reinforcement Learning (HRA)
[Learning with Opponent-Learning Awareness (LOLA) - to-model-other-minds/)] Foerster et al., 2017 @ OpenAI, Oxford, Berkeley, CMU
manipulation
manipulation
manipulation
[Learning to Navigate in Cities Without a Map
[Human-level performance in first-person multiplayer games with population-based deep reinforcement learning - the-flag/)] Jaderberg et al, 2018 @ DeepMind
generalization
[Learning to Navigate in Complex Environments
transfer
meta-learning
[Learning to act by predicting the future (VizDoom 2016 Full DM Winner)
[Playing FPS Games with Deep Reinforcement Learning (VizDoom 2016 Limited DM 2nd place)
generalization - dexterity/)] Andrychowicz et al., 2018 @ OpenAI
generalization - from-simulation/)] Pinto et al., 2017 @ OpenAI, CMU
generalization - from-simulation/)] Peng et al., 2017 @ OpenAI, Berkeley
[Emergence of Locomotion Behaviours in Rich Environments - flexible-behaviours-simulated-environments/)] Heess et al., 2017 @ DeepMind
[Programmable Agents
[AutoAugment: Learning Augmentation Policies from Data
evolution
[Learning Transferable Architectures for Scalable Image Recognition
[Neural Optimizer Search with Reinforcement Learning, pdf
[Neural Architecture Search with Reinforcement Learning
[A Deep Reinforcement Learning Chatbot
[Reinforcement Learning: An Introduction, pdf
[A Brief Survey of Deep Reinforcement Learning
[How to Read a Paper
Transfromers: [Attention is all you need

Programming Languages

Keywords

pytorch 2 reinforcement-learning 2 tensorflow 1 rl 1 ml 1 google 1 ai 1 toolbox 1 stable-baselines 1 sde 1 sb3 1 robotics 1 reinforcement-learning-algorithms 1 python 1 openai 1 machine-learning 1 gym 1 gsde 1 second-order 1 roboschool 1 proximal-policy-optimization 1 ppo 1 natural-gradients 1 mujoco 1 kronecker-factored-approximation 1 kfac 1 hessian 1 deep-reinforcement-learning 1 deep-learning 1 continuous-control 1 atari 1 ale 1 advantage-actor-critic 1 actor-critic 1 acktr 1 a2c 1 baselines 1