awesome-reinforcement-learning
  
  
    A curated list of awesome frameworks, libraries, tools, environments, tutorials, research papers, and resources for reinforcement learning (RL). This list covers fundamental concepts, advanced algorithms, applications, and popular frameworks for building RL models. 
    https://github.com/awesomelistsio/awesome-reinforcement-learning
  
        Last synced: 1 day ago 
        JSON representation
    
- 
            Learning Resources- DeepMind’s RL Course - A comprehensive RL course by DeepMind researchers.
- Deep Reinforcement Learning Nanodegree (Udacity) - A program focused on deep RL techniques.
- Coursera: Reinforcement Learning Specialization - A series of courses on RL by the University of Alberta.
- Deep Reinforcement Learning Nanodegree (Udacity) - A program focused on deep RL techniques.
 
- 
            Core Algorithms- SARSA (State-Action-Reward-State-Action) - An on-policy RL algorithm.
- Deep Q-Learning (DQN) - A value-based method using deep learning to approximate the Q-value function.
- REINFORCE Algorithm - A Monte Carlo policy gradient method for training RL agents.
- Deep Q-Learning (DQN) - A value-based method using deep learning to approximate the Q-value function.
- SARSA (State-Action-Reward-State-Action) - An on-policy RL algorithm.
- REINFORCE Algorithm - A Monte Carlo policy gradient method for training RL agents.
- Policy Gradient Methods - A class of algorithms that directly optimize the policy.
- Actor-Critic Methods - Algorithms that use both policy (actor) and value (critic) functions.
 
- 
            Frameworks and Libraries- OpenAI Baselines - A collection of high-quality implementations of RL algorithms by OpenAI.
- Ray RLlib - A scalable reinforcement learning library built on top of Ray.
- Dopamine - A research framework by Google focused on fast prototyping of RL algorithms.
- TF-Agents - A library for reinforcement learning using TensorFlow.
- Ray RLlib - A scalable reinforcement learning library built on top of Ray.
- TF-Agents - A library for reinforcement learning using TensorFlow.
- OpenAI Baselines - A collection of high-quality implementations of RL algorithms by OpenAI.
- Dopamine - A research framework by Google focused on fast prototyping of RL algorithms.
- Acme - A library by DeepMind for building and testing reinforcement learning agents.
 
- 
            Tools and Environments- OpenAI Gym - A toolkit for developing and comparing RL algorithms with a variety of environments.
- Unity ML-Agents - A toolkit by Unity for training intelligent agents using RL.
- PyBullet - An open-source Python module for physics simulations in RL.
- PettingZoo - A library of multi-agent reinforcement learning environments.
- CARLA Simulator - An open-source simulator for autonomous driving research using RL.
- OpenAI Gym - A toolkit for developing and comparing RL algorithms with a variety of environments.
- Unity ML-Agents - A toolkit by Unity for training intelligent agents using RL.
- PyBullet - An open-source Python module for physics simulations in RL.
- PettingZoo - A library of multi-agent reinforcement learning environments.
- CARLA Simulator - An open-source simulator for autonomous driving research using RL.
 
- 
            Advanced Algorithms- Trust Region Policy Optimization (TRPO) - An algorithm designed to maintain stable updates of the policy.
- Proximal Policy Optimization (PPO) - A stable and efficient policy optimization method.
- Deep Deterministic Policy Gradient (DDPG) - An off-policy algorithm for continuous action spaces.
- Soft Actor-Critic (SAC) - An entropy-regularized algorithm for stable learning in continuous action spaces.
- Proximal Policy Optimization (PPO) - A stable and efficient policy optimization method.
- Deep Deterministic Policy Gradient (DDPG) - An off-policy algorithm for continuous action spaces.
- Soft Actor-Critic (SAC) - An entropy-regularized algorithm for stable learning in continuous action spaces.
- Trust Region Policy Optimization (TRPO) - An algorithm designed to maintain stable updates of the policy.
- A3C (Asynchronous Advantage Actor-Critic) - An efficient, asynchronous RL algorithm for training agents.
 
            Programming Languages
          
          
        
            Categories
          
          
        
            Sub Categories