Projects in Awesome Lists tagged with a2c
A curated list of projects in awesome lists tagged with a2c .
https://github.com/thu-ml/tianshou
An elegant PyTorch deep reinforcement learning library.
a2c atari bcq cql ddpg double-dqn dqn drl imitation-learning mujoco npg policy-gradient ppo pytorch rl sac td3 transferlab trpo
Last synced: 13 May 2025
https://github.com/vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
a2c actor-critic advantage-actor-critic ale atari deep-learning deep-reinforcement-learning gym machine-learning phasic-policy-gradient ppo proximal-policy-optimization python pytorch reinforcement-learning wandb
Last synced: 14 May 2025
https://github.com/andri27-ts/reinforcement-learning
Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python. Reinforcement Learning + Deep Learning
a2c artificial-intelligence deep-learning deep-reinforcement-learning deepmind dqn evolution-strategies machine-learning policy-gradients ppo qlearning reinforcement-learning
Last synced: 15 May 2025
https://github.com/andri27-ts/Reinforcement-Learning
Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python. Reinforcement Learning + Deep Learning
a2c artificial-intelligence deep-learning deep-reinforcement-learning deepmind dqn evolution-strategies machine-learning policy-gradients ppo qlearning reinforcement-learning
Last synced: 15 Mar 2025
https://github.com/sweetice/deep-reinforcement-learning-with-pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
a2c a3c actor-critic actor-critic-algorithm algorithm alphago deep-learning deep-reinforcement-learning dqn policy-gradient ppo pytorch reinforce resnet sac sarsa td3 trpo
Last synced: 14 May 2025
https://github.com/ai4finance-foundation/elegantrl
Massively Parallel Deep Reinforcement Learning. 🔥
a2c bipedalwalkerhardcore ddpg dqn drl-pytorch efficient gae lightweight model-free-rl multiple-gpu per ppo pytorch reinforcement-learning sac stable td3
Last synced: 13 May 2025
https://github.com/AI4Finance-Foundation/ElegantRL
Massively Parallel Deep Reinforcement Learning. 🔥
a2c bipedalwalkerhardcore ddpg dqn drl-pytorch efficient gae lightweight model-free-rl multiple-gpu per ppo pytorch reinforcement-learning sac stable td3
Last synced: 02 Apr 2025
https://github.com/sweetice/Deep-reinforcement-learning-with-pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
a2c a3c actor-critic actor-critic-algorithm algorithm alphago deep-learning deep-reinforcement-learning dqn policy-gradient ppo pytorch reinforce resnet sac sarsa td3 trpo
Last synced: 01 May 2025
https://github.com/simoninithomas/deep_reinforcement_learning_course
Implementations from the free course Deep Reinforcement Learning with Tensorflow and PyTorch
a2c actor-critic deep-learning deep-q-learning deep-q-network deep-reinforcement-learning ppo pytorch qlearning tensorflow tensorflow-tutorials unity
Last synced: 14 May 2025
https://github.com/simoninithomas/Deep_reinforcement_learning_Course
Implementations from the free course Deep Reinforcement Learning with Tensorflow and PyTorch
a2c actor-critic deep-learning deep-q-learning deep-q-network deep-reinforcement-learning ppo pytorch qlearning tensorflow tensorflow-tutorials unity
Last synced: 19 Jul 2025
https://github.com/ikostrikov/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
a2c acktr actor-critic advantage-actor-critic ale atari continuous-control deep-learning deep-reinforcement-learning hessian kfac kronecker-factored-approximation mujoco natural-gradients ppo proximal-policy-optimization pytorch reinforcement-learning roboschool second-order
Last synced: 13 Apr 2025
https://github.com/shangtongzhang/deeprl
Modularized Implementation of Deep RL Algorithms in PyTorch
a2c categorical-dqn ddpg deep-reinforcement-learning deeprl double-dqn dqn dueling-network-architecture option-critic option-critic-architecture ppo prioritized-experience-replay pytorch quantile-regression rainbow td3
Last synced: 13 Apr 2025
https://github.com/ShangtongZhang/DeepRL
Modularized Implementation of Deep RL Algorithms in PyTorch
a2c categorical-dqn ddpg deep-reinforcement-learning deeprl double-dqn dqn dueling-network-architecture option-critic option-critic-architecture ppo prioritized-experience-replay pytorch quantile-regression rainbow td3
Last synced: 01 Apr 2025
https://github.com/seungeunrho/minimalrl
Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)
a2c a3c acer ddpg deep-learning deep-reinforcement-learning dqn machine-learning policy-gradients ppo pytorch reinforce reinforcement-learning sac simple
Last synced: 15 May 2025
https://github.com/seungeunrho/minimalRL
Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)
a2c a3c acer ddpg deep-learning deep-reinforcement-learning dqn machine-learning policy-gradients ppo pytorch reinforce reinforcement-learning sac simple
Last synced: 03 Apr 2025
https://github.com/kengz/slm-lab
Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".
a2c a3c benchmark deep-reinforcement-learning dqn policy-gradient ppo pytorch reinforcement-learning sac
Last synced: 11 Feb 2026
https://github.com/kengz/SLM-Lab
Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".
a2c a3c benchmark deep-reinforcement-learning dqn policy-gradient ppo pytorch reinforcement-learning sac
Last synced: 01 Apr 2025
https://github.com/khrylx/pytorch-rl
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
a2c deep-reinforcement-learning fisher-vectors generative-adversarial-network policy-gradient ppo proximal-policy-optimization pytorch pytorch-rl reinforcement-learning trpo
Last synced: 12 Apr 2025
https://github.com/Khrylx/PyTorch-RL
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
a2c deep-reinforcement-learning fisher-vectors generative-adversarial-network policy-gradient ppo proximal-policy-optimization pytorch pytorch-rl reinforcement-learning trpo
Last synced: 29 Apr 2025
https://github.com/qfettes/deeprl-tutorials
Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch
a2c actor-critic advantage-actor-critic categorical-dqn deep-q-network deep-recurrent-q-network deep-reinforcement-learning deeprl-tutorials double-dqn dueling-dqn gae multi-step-learning noisy-networks ppo prioritized-experience-replay python3 pytorch quantile-regression rainbow reinforcement-learning
Last synced: 16 May 2025
https://github.com/agi-brain/xuance
XuanCe: A Comprehensive and Unified Deep Reinforcement Learning Library
a2c atari ddpg dqn google-research-football maddpg magent mappo mindspore mpe mujoco multi-agent-reinforcement-learning ppo pytorch qmix reinforcement-learning reinforcement-learning-library starcraft2 tensorflow2
Last synced: 27 May 2026
https://github.com/qfettes/DeepRL-Tutorials
Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch
a2c actor-critic advantage-actor-critic categorical-dqn deep-q-network deep-recurrent-q-network deep-reinforcement-learning deeprl-tutorials double-dqn dueling-dqn gae multi-step-learning noisy-networks ppo prioritized-experience-replay python3 pytorch quantile-regression rainbow reinforcement-learning
Last synced: 01 May 2025
https://github.com/lcswillems/rl-starter-files
RL starter files in order to immediately train, visualize and evaluate an agent without writing any line of code
a2c a3c minigrid multi-process ppo preprocessed-observations pytorch reward-shaping
Last synced: 26 Oct 2025
https://github.com/TianhongDai/reinforcement-learning-algorithms
This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)
a2c actor-critic algorithm atari2600 ddpg deep-learning deep-reinforcement-learning dqn dueling-dqn flappy-bird ppo proximal-policy-optimization pytorch sac soft-actor-critic trpo trust-region-policy-optimization
Last synced: 19 Jul 2025
https://github.com/cpnota/autonomous-learning-library
A PyTorch library for building deep reinforcement learning agents.
a2c advantage-actor-critic ddpg deep-deterministic-policy-gradient deep-q-learning deep-reinforcement-learning dqn dqn-pytorch ppo proximal-policy-optimization reinforcement-learning reinforcement-learning-algorithms sac soft-actor-critic
Last synced: 11 Sep 2025
https://github.com/archsyscall/deeprl-tensorflow2
🐋 Simple implementations of various popular Deep Reinforcement Learning algorithms using TensorFlow2
a2c a3c ddpg deep-learning deep-reinforcement-learning double-dqn dqn dueling-dqn machine-learning ppo rainbow-dqn reinforce reinforcement-learning sac tensorflow tensorflow2 trpo
Last synced: 05 Apr 2025
https://github.com/archsyscall/DeepRL-TensorFlow2
🐋 Simple implementations of various popular Deep Reinforcement Learning algorithms using TensorFlow2
a2c a3c ddpg deep-learning deep-reinforcement-learning double-dqn dqn dueling-dqn machine-learning ppo rainbow-dqn reinforce reinforcement-learning sac tensorflow tensorflow2 trpo
Last synced: 15 Oct 2025
https://github.com/chenglongchen/pytorch-drl
PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.
a2c acktr actor-critic advantage-actor-critic ddpg deep-deterministic-policy-gradient deep-q-network deep-reinforcement-learning dqn drl madrl multi-agent ppo proximal-policy-optimization pytorch reinforcement-learning rl
Last synced: 05 Apr 2025
https://github.com/Omegastick/pytorch-cpp-rl
PyTorch C++ Reinforcement Learning
a2c actor-critic advantage-actor-critic continuous-control cplusplus cpp libtorch ppo proximal-policy-optimization pytorch pytorch-cpp-frontend pytorch-rl reinforcement-learning reinforcement-learning-algorithms
Last synced: 07 May 2025
https://github.com/dongminlee94/deep_rl
PyTorch implementation of deep reinforcement learning algorithms
a2c ddpg ddqn deep-reinforcement-learning dqn model-free-rl npg ppo pytorch sac sac-aea td3 trpo vpg
Last synced: 05 Apr 2025
https://github.com/sudharsan13296/deep-reinforcement-learning-with-python
Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math
a2c a3c actor-critic bellman-equation c51 ddpg deep-learning deep-reinforcement-learning double-dqn dqn inverse-reinforcement-learning openai-gym policy-gradient ppo q-learning reinforcement-learning sac td3 trpo
Last synced: 05 Apr 2025
https://github.com/pythonlessons/reinforcement_learning
Reinforcement learning tutorials
a2c a3c actor-critic-algorythm bipedalwalker d3qn ddqn dqn dueling-dqn lunarlander policy-gradient ppo ppo-agent reinforcement-learning
Last synced: 06 Oct 2025
https://github.com/bentrevett/pytorch-rl
Tutorials for reinforcement learning in PyTorch and Gym by implementing a few of the popular algorithms. [IN PROGRESS]
a2c actor-critic advantage-actor-critic generalized-advantage-estimation policy-gradient pytorch pytorch-implementation pytorch-implmention pytorch-rl pytorch-tutorial pytorch-tutorials reinforcement-learning reinforcement-learning-algorithms rl
Last synced: 27 Mar 2025
https://github.com/inoryy/tensorflow2-deep-reinforcement-learning
Code accompanying the blog post "Deep Reinforcement Learning with TensorFlow 2.1"
a2c advantage-actor-critic deep-reinforcement-learning keras tensorflow tensorflow2
Last synced: 08 May 2025
https://github.com/lcswillems/torch-ac
Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO
a2c a3c actor-critic advantage-actor-critic deep-reinforcement-learning minigrid multi-process ppo proximal-policy-optimization pytorch recurrent recurrent-neural-networks reinforcement-learning reward-shaping
Last synced: 10 Oct 2025
https://github.com/tobiasfshr/deep-reinforcement-learning-drone-control
A drone control system based on deep reinforcement learning with Tensorflow and ROS
a2c deep-reinforcement-learning drone-controller gazebo ros
Last synced: 03 Jan 2026
https://github.com/godka/pensieve-ppo
The simplest implementation of Pensieve (SIGCOMM' 17) via state-of-the-art RL algorithms, including PPO, DQN, SAC, and support for both TensorFlow and PyTorch.
a2c deep-learning dqn pensieve ppo pytorch reinforcement-learning tensorflow
Last synced: 07 Apr 2025
https://github.com/nikhilbarhate99/actor-critic-pytorch
Policy Gradient Actor-Critic PyTorch | Lunar Lander v2
a2c actor-critic deep-reinforcement-learning openai-gym openai-gym-environments policy-gradient pytorch pytorch-implmention pytorch-tutorial reinforcement-learning-algorithms
Last synced: 13 Jul 2025
https://github.com/jcwleo/mario_rl
a2c actor-critic curiosity-driven deep-learning icm ppo pytorch reinforcement-learning supermariobros
Last synced: 03 Apr 2025
https://github.com/pawel-kieliszczyk/snake-reinforcement-learning
AI (A2C agent) mastering the game of Snake with TensorFlow 2.0
a2c artificial-intelligence keras machine-learning reinforcement-learning snake tensorflow
Last synced: 10 Apr 2025
https://github.com/abhilash1910/deep_reinforcement_learning_trading
Deep Reinforcement Learning for Trading
a2c a3c-agent actor-critic actor-critic-methods arima-model cnn-lstm-models ddpg double-dqn dqn-agents dueling-dqn ppo-agent prophet-model sac sarimax soft tensorflow2 time-series trading trading-strategies trpo
Last synced: 10 Apr 2025
https://github.com/ai-glimpse/toyrl
Reinforce learning is awesome!
a2c aiglimpse build-your-own-x double-dqn dqn ppo python3 reinforce reinforcement-learning sarsa toyrl
Last synced: 10 Jun 2025
https://github.com/lucadellalib/actorch
Deep reinforcement learning framework for fast prototyping based on PyTorch
a2c acktr actor-critic ddpg deep-learning deep-reinforcement-learning distributional-rl gymnasium machine-learning ppo python pytorch ray-tune reinforcement-learning retrace sac td3 trpo vtrace
Last synced: 13 Apr 2025
https://github.com/qfettes/curiositydrivenexplorationbyselfsupervisedprediction
Reproduction of Curiosity-driven Exploration by Self-supervised Prediction in PyTorch
a2c curiosity deep-learning deep-reinforcement-learning exploration gae neural-network python3 pytorch recurrent-neural-networks reinforcement-learning
Last synced: 10 Apr 2025
https://github.com/xffxff/endorphin
Like dopamine, but for different algorithms
a2c dopamine ppo pytorch reinforcement-learning
Last synced: 12 Apr 2025
https://github.com/rsgoksel/mechopter
PyGame-based quadcopter simulator & Reinforcement Learning Project
a2c dqn ipynb ppo pygame python quadcopter reinforcement-learning
Last synced: 28 Oct 2025
https://github.com/nashory/rl-tutorial-by-games
Game collection trained by SOTA reinforcement learning algorithms for tutorial
a2c atari deep-learning dqn game pytorch reinforcement-learning tutorial
Last synced: 14 Sep 2025
https://github.com/maik97/wacky-rl
Custom Reinforcement Learning Agents
a2c actor-critic deep-learning deep-reinforcement-learning dqn gym policy-gradient ppo proximal-policy-optimization reinforcement-learning rl-agents rl-algorithms sac soft-actor-critic
Last synced: 21 Sep 2025
https://github.com/weiyx16/rl_toy_example
Deep RL toy example based on gym package with several methods
a2c a3c continuous-mountain-car dqn-tensorflow gym mountain-car qlearning sarsa sarsa-lambda
Last synced: 15 Mar 2025
https://github.com/sophiaas/rlbase
Modular Deep RL infrastructure in PyTorch
a2c deep-reinforcement-learning hierarchical-reinforcement-learning option-critic policy-gradient ppo proximal-policy-optimization pytorch-rl reinforcement-learning
Last synced: 14 May 2026
https://github.com/mind/a2c
The OpenAI advantage actor critic (A2C) model
a2c openai openai-gym reinforcement-learning
Last synced: 08 May 2026
https://github.com/wondervictor/reinforcement.learning
Reinforcement Learning Playground
a2c ddpg deep-reinforcement-learning dqn dueling-network-architecture reinforcement-learning
Last synced: 17 Apr 2026
https://github.com/dylanzenner/stock-rl
This repo contains our project for CPE 800 which focuses on reinforcement learning for stock prediction
a2c alpaca-trading-api aws ddpg ensemble github-actions ppo reinforcement-learning stablebaselines3 td3 terraform terraform-cloud
Last synced: 12 Jan 2026
https://github.com/peiranli/rl
My RL agent in PyTorch
a2c continuous-versions ddpg policy-gradient ppo reinforcement-learning-algorithms
Last synced: 15 Jul 2025
https://github.com/yunjaechoi/rlagent
Reinforcement learning framework in tensorflow, compatible with OpenAI Gym like environments.
a2c ddpg reinforcement-learning tensorflow
Last synced: 11 May 2026
https://github.com/antoniszks/cartpole-optimization-with-reinforcement-learning
A university project where we implement and experiment with different Reinforcement Learning algorithms and trying to optimize the CartPole environment from OpenAI Gym.
a2c ai ai-agents dqn dueling-network-architecture gymnasium ml openai openai-gym ppo reinforcement-learning stable-baselines3 transformers
Last synced: 30 Apr 2026
https://github.com/ttitcombe/a3c
PyTorch implementation of Asynchronous (and Synchronous) Advantage Actor Critic
a2c a3c advantage-actor-critic pytorch pytorch-implementation reinforcement-learning reinforcement-learning-algorithms
Last synced: 30 Apr 2026
https://github.com/dhyanesh18/flappbird-rl
PPO agent and A2C agents for Flappybird. Includes scripts, training code, and evaluation tools.
a2c flappybird opencv ppo pygame-learning-environment reinforcement-learning stablebaselines3
Last synced: 06 May 2026
https://github.com/dmsovetov/reinforcement
Deep reinforcement learning experiments
a2c a3c deeplearning pytorch qlearning reinforce reinforcement-learning
Last synced: 07 May 2026
https://github.com/ethanmclark1/rl_toolkit
Implementation of core reinforcement learning algorithms with PyTorch
a2c deep-reinforcement-learning deep-rl dueling-dqn openai-gym policy-gradient pytorch q-table reinforcement-learning rl
Last synced: 16 Apr 2026
https://github.com/indutny/haggling_rl
RL learning model for Hola's Haggling Challenge
a2c haggle ppo reinforcement-learning
Last synced: 19 May 2026
https://github.com/starlitdreams/kungfu-a2c
Kung Fu Master A2C Agent is a PyTorch implementation of the Advantage Actor–Critic (A2C) algorithm, engineered to learn and play the Atari classic Kung Fu Master. It uses Gymnasium (with ALE ROMs) to manage multiple parallel environments and a custom preprocessing wrapper.
a2c a2c-agent actor-critic agentic-ai atari convolutional-neural-networks deep-reinforcement-learning gymnasium image-preprocessing kungfumaster python python3 pytorch reinforcement-learning
Last synced: 17 Apr 2026