Projects in Awesome Lists tagged with a2c

https://github.com/thu-ml/tianshou

An elegant PyTorch deep reinforcement learning library.

a2c atari bcq cql ddpg double-dqn dqn drl imitation-learning mujoco npg policy-gradient ppo pytorch rl sac td3 transferlab trpo

Last synced: 13 May 2025

https://github.com/vwxyzjn/cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

a2c actor-critic advantage-actor-critic ale atari deep-learning deep-reinforcement-learning gym machine-learning phasic-policy-gradient ppo proximal-policy-optimization python pytorch reinforcement-learning wandb

Last synced: 14 May 2025

https://github.com/andri27-ts/reinforcement-learning

Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python. Reinforcement Learning + Deep Learning

a2c artificial-intelligence deep-learning deep-reinforcement-learning deepmind dqn evolution-strategies machine-learning policy-gradients ppo qlearning reinforcement-learning

Last synced: 15 May 2025

https://github.com/andri27-ts/Reinforcement-Learning

Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python. Reinforcement Learning + Deep Learning

a2c artificial-intelligence deep-learning deep-reinforcement-learning deepmind dqn evolution-strategies machine-learning policy-gradients ppo qlearning reinforcement-learning

Last synced: 15 Mar 2025

https://github.com/sweetice/deep-reinforcement-learning-with-pytorch

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

a2c a3c actor-critic actor-critic-algorithm algorithm alphago deep-learning deep-reinforcement-learning dqn policy-gradient ppo pytorch reinforce resnet sac sarsa td3 trpo

Last synced: 14 May 2025

https://github.com/ai4finance-foundation/elegantrl

Massively Parallel Deep Reinforcement Learning. 🔥

a2c bipedalwalkerhardcore ddpg dqn drl-pytorch efficient gae lightweight model-free-rl multiple-gpu per ppo pytorch reinforcement-learning sac stable td3

Last synced: 13 May 2025

https://github.com/AI4Finance-Foundation/ElegantRL

Massively Parallel Deep Reinforcement Learning. 🔥

a2c bipedalwalkerhardcore ddpg dqn drl-pytorch efficient gae lightweight model-free-rl multiple-gpu per ppo pytorch reinforcement-learning sac stable td3

Last synced: 02 Apr 2025

https://github.com/sweetice/Deep-reinforcement-learning-with-pytorch

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

a2c a3c actor-critic actor-critic-algorithm algorithm alphago deep-learning deep-reinforcement-learning dqn policy-gradient ppo pytorch reinforce resnet sac sarsa td3 trpo

Last synced: 01 May 2025

https://github.com/simoninithomas/deep_reinforcement_learning_course

Implementations from the free course Deep Reinforcement Learning with Tensorflow and PyTorch

a2c actor-critic deep-learning deep-q-learning deep-q-network deep-reinforcement-learning ppo pytorch qlearning tensorflow tensorflow-tutorials unity

Last synced: 14 May 2025

https://github.com/simoninithomas/Deep_reinforcement_learning_Course

Implementations from the free course Deep Reinforcement Learning with Tensorflow and PyTorch

a2c actor-critic deep-learning deep-q-learning deep-q-network deep-reinforcement-learning ppo pytorch qlearning tensorflow tensorflow-tutorials unity

Last synced: 19 Jul 2025

https://github.com/ikostrikov/pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

a2c acktr actor-critic advantage-actor-critic ale atari continuous-control deep-learning deep-reinforcement-learning hessian kfac kronecker-factored-approximation mujoco natural-gradients ppo proximal-policy-optimization pytorch reinforcement-learning roboschool second-order

Last synced: 13 Apr 2025

https://github.com/shangtongzhang/deeprl

Modularized Implementation of Deep RL Algorithms in PyTorch

a2c categorical-dqn ddpg deep-reinforcement-learning deeprl double-dqn dqn dueling-network-architecture option-critic option-critic-architecture ppo prioritized-experience-replay pytorch quantile-regression rainbow td3

Last synced: 13 Apr 2025

https://github.com/ShangtongZhang/DeepRL

Modularized Implementation of Deep RL Algorithms in PyTorch

a2c categorical-dqn ddpg deep-reinforcement-learning deeprl double-dqn dqn dueling-network-architecture option-critic option-critic-architecture ppo prioritized-experience-replay pytorch quantile-regression rainbow td3

Last synced: 01 Apr 2025

https://github.com/seungeunrho/minimalrl

Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)

a2c a3c acer ddpg deep-learning deep-reinforcement-learning dqn machine-learning policy-gradients ppo pytorch reinforce reinforcement-learning sac simple

Last synced: 15 May 2025

https://github.com/seungeunrho/minimalRL

Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)

a2c a3c acer ddpg deep-learning deep-reinforcement-learning dqn machine-learning policy-gradients ppo pytorch reinforce reinforcement-learning sac simple

Last synced: 03 Apr 2025

https://github.com/kengz/slm-lab

Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".

a2c a3c benchmark deep-reinforcement-learning dqn policy-gradient ppo pytorch reinforcement-learning sac

Last synced: 11 Feb 2026

https://github.com/kengz/SLM-Lab

Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".

a2c a3c benchmark deep-reinforcement-learning dqn policy-gradient ppo pytorch reinforcement-learning sac

Last synced: 01 Apr 2025

https://github.com/khrylx/pytorch-rl

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

a2c deep-reinforcement-learning fisher-vectors generative-adversarial-network policy-gradient ppo proximal-policy-optimization pytorch pytorch-rl reinforcement-learning trpo

Last synced: 12 Apr 2025

https://github.com/Khrylx/PyTorch-RL

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

a2c deep-reinforcement-learning fisher-vectors generative-adversarial-network policy-gradient ppo proximal-policy-optimization pytorch pytorch-rl reinforcement-learning trpo

Last synced: 29 Apr 2025

https://github.com/qfettes/deeprl-tutorials

Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch

a2c actor-critic advantage-actor-critic categorical-dqn deep-q-network deep-recurrent-q-network deep-reinforcement-learning deeprl-tutorials double-dqn dueling-dqn gae multi-step-learning noisy-networks ppo prioritized-experience-replay python3 pytorch quantile-regression rainbow reinforcement-learning

Last synced: 16 May 2025

https://github.com/agi-brain/xuance

XuanCe: A Comprehensive and Unified Deep Reinforcement Learning Library

a2c atari ddpg dqn google-research-football maddpg magent mappo mindspore mpe mujoco multi-agent-reinforcement-learning ppo pytorch qmix reinforcement-learning reinforcement-learning-library starcraft2 tensorflow2

Last synced: 27 May 2026

https://github.com/qfettes/DeepRL-Tutorials

Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch

a2c actor-critic advantage-actor-critic categorical-dqn deep-q-network deep-recurrent-q-network deep-reinforcement-learning deeprl-tutorials double-dqn dueling-dqn gae multi-step-learning noisy-networks ppo prioritized-experience-replay python3 pytorch quantile-regression rainbow reinforcement-learning

Last synced: 01 May 2025

https://github.com/lcswillems/rl-starter-files

RL starter files in order to immediately train, visualize and evaluate an agent without writing any line of code

a2c a3c minigrid multi-process ppo preprocessed-observations pytorch reward-shaping

Last synced: 26 Oct 2025

https://github.com/TianhongDai/reinforcement-learning-algorithms

This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)

a2c actor-critic algorithm atari2600 ddpg deep-learning deep-reinforcement-learning dqn dueling-dqn flappy-bird ppo proximal-policy-optimization pytorch sac soft-actor-critic trpo trust-region-policy-optimization

Last synced: 19 Jul 2025

https://github.com/cpnota/autonomous-learning-library

A PyTorch library for building deep reinforcement learning agents.

a2c advantage-actor-critic ddpg deep-deterministic-policy-gradient deep-q-learning deep-reinforcement-learning dqn dqn-pytorch ppo proximal-policy-optimization reinforcement-learning reinforcement-learning-algorithms sac soft-actor-critic

Last synced: 11 Sep 2025

https://github.com/archsyscall/deeprl-tensorflow2

🐋 Simple implementations of various popular Deep Reinforcement Learning algorithms using TensorFlow2

a2c a3c ddpg deep-learning deep-reinforcement-learning double-dqn dqn dueling-dqn machine-learning ppo rainbow-dqn reinforce reinforcement-learning sac tensorflow tensorflow2 trpo

Last synced: 05 Apr 2025

https://github.com/archsyscall/DeepRL-TensorFlow2

🐋 Simple implementations of various popular Deep Reinforcement Learning algorithms using TensorFlow2

a2c a3c ddpg deep-learning deep-reinforcement-learning double-dqn dqn dueling-dqn machine-learning ppo rainbow-dqn reinforce reinforcement-learning sac tensorflow tensorflow2 trpo

Last synced: 15 Oct 2025

https://github.com/chenglongchen/pytorch-drl

PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.

a2c acktr actor-critic advantage-actor-critic ddpg deep-deterministic-policy-gradient deep-q-network deep-reinforcement-learning dqn drl madrl multi-agent ppo proximal-policy-optimization pytorch reinforcement-learning rl

Last synced: 05 Apr 2025

https://github.com/Omegastick/pytorch-cpp-rl

PyTorch C++ Reinforcement Learning

a2c actor-critic advantage-actor-critic continuous-control cplusplus cpp libtorch ppo proximal-policy-optimization pytorch pytorch-cpp-frontend pytorch-rl reinforcement-learning reinforcement-learning-algorithms

Last synced: 07 May 2025

https://github.com/dongminlee94/deep_rl

PyTorch implementation of deep reinforcement learning algorithms

a2c ddpg ddqn deep-reinforcement-learning dqn model-free-rl npg ppo pytorch sac sac-aea td3 trpo vpg

Last synced: 05 Apr 2025

https://github.com/sudharsan13296/deep-reinforcement-learning-with-python

Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math

a2c a3c actor-critic bellman-equation c51 ddpg deep-learning deep-reinforcement-learning double-dqn dqn inverse-reinforcement-learning openai-gym policy-gradient ppo q-learning reinforcement-learning sac td3 trpo

Last synced: 05 Apr 2025

https://github.com/pythonlessons/reinforcement_learning

Reinforcement learning tutorials

a2c a3c actor-critic-algorythm bipedalwalker d3qn ddqn dqn dueling-dqn lunarlander policy-gradient ppo ppo-agent reinforcement-learning

Last synced: 06 Oct 2025

https://github.com/bentrevett/pytorch-rl

Tutorials for reinforcement learning in PyTorch and Gym by implementing a few of the popular algorithms. [IN PROGRESS]

a2c actor-critic advantage-actor-critic generalized-advantage-estimation policy-gradient pytorch pytorch-implementation pytorch-implmention pytorch-rl pytorch-tutorial pytorch-tutorials reinforcement-learning reinforcement-learning-algorithms rl

Last synced: 27 Mar 2025

https://github.com/inoryy/tensorflow2-deep-reinforcement-learning

Code accompanying the blog post "Deep Reinforcement Learning with TensorFlow 2.1"

a2c advantage-actor-critic deep-reinforcement-learning keras tensorflow tensorflow2

Last synced: 08 May 2025

https://github.com/lcswillems/torch-ac

Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO

a2c a3c actor-critic advantage-actor-critic deep-reinforcement-learning minigrid multi-process ppo proximal-policy-optimization pytorch recurrent recurrent-neural-networks reinforcement-learning reward-shaping

Last synced: 10 Oct 2025

https://github.com/tobiasfshr/deep-reinforcement-learning-drone-control

A drone control system based on deep reinforcement learning with Tensorflow and ROS

a2c deep-reinforcement-learning drone-controller gazebo ros

Last synced: 03 Jan 2026

https://github.com/godka/pensieve-ppo

The simplest implementation of Pensieve (SIGCOMM' 17) via state-of-the-art RL algorithms, including PPO, DQN, SAC, and support for both TensorFlow and PyTorch.

a2c deep-learning dqn pensieve ppo pytorch reinforcement-learning tensorflow

Last synced: 07 Apr 2025

https://github.com/nikhilbarhate99/actor-critic-pytorch

Policy Gradient Actor-Critic PyTorch | Lunar Lander v2

a2c actor-critic deep-reinforcement-learning openai-gym openai-gym-environments policy-gradient pytorch pytorch-implmention pytorch-tutorial reinforcement-learning-algorithms

Last synced: 13 Jul 2025

https://github.com/jcwleo/mario_rl

a2c actor-critic curiosity-driven deep-learning icm ppo pytorch reinforcement-learning supermariobros

Last synced: 03 Apr 2025

https://github.com/alok/rl_implementations

a2c actor-critic ddpg deep-learning deep-q-learning deep-q-network deep-reinforcement-learning dqn hierarchical-reinforcement-learning lstm machine-learning meta-learning policy-gradient reinforcement-learning reptile

Last synced: 05 May 2025

https://github.com/pawel-kieliszczyk/snake-reinforcement-learning

AI (A2C agent) mastering the game of Snake with TensorFlow 2.0

a2c artificial-intelligence keras machine-learning reinforcement-learning snake tensorflow

Last synced: 10 Apr 2025

https://github.com/abhilash1910/deep_reinforcement_learning_trading

Deep Reinforcement Learning for Trading

a2c a3c-agent actor-critic actor-critic-methods arima-model cnn-lstm-models ddpg double-dqn dqn-agents dueling-dqn ppo-agent prophet-model sac sarimax soft tensorflow2 time-series trading trading-strategies trpo

Last synced: 10 Apr 2025

https://github.com/ai-glimpse/toyrl

Reinforce learning is awesome!

a2c aiglimpse build-your-own-x double-dqn dqn ppo python3 reinforce reinforcement-learning sarsa toyrl

Last synced: 10 Jun 2025

https://github.com/jianzhnie/rltoolkit

RLToolkit is a flexible and high-efficient reinforcement learning framework. Include implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

a2c actor-critic ddpg ddqn dqn maddpg mappo ppo qmix rl sac td3 trpo

Last synced: 03 Aug 2025

https://github.com/lucadellalib/actorch

Deep reinforcement learning framework for fast prototyping based on PyTorch

a2c acktr actor-critic ddpg deep-learning deep-reinforcement-learning distributional-rl gymnasium machine-learning ppo python pytorch ray-tune reinforcement-learning retrace sac td3 trpo vtrace

Last synced: 13 Apr 2025

https://github.com/qfettes/curiositydrivenexplorationbyselfsupervisedprediction

Reproduction of Curiosity-driven Exploration by Self-supervised Prediction in PyTorch

a2c curiosity deep-learning deep-reinforcement-learning exploration gae neural-network python3 pytorch recurrent-neural-networks reinforcement-learning

Last synced: 10 Apr 2025

https://github.com/chagmgang/pysc2_rl

a2c advantage-actor-critic deep-learning deep-q-learning ppo proximal-policy-optimization pysc2 pysc2-mini-games reinfrocement-learning

Last synced: 02 Jul 2025

https://github.com/xffxff/endorphin

Like dopamine, but for different algorithms

a2c dopamine ppo pytorch reinforcement-learning

Last synced: 12 Apr 2025

https://github.com/rsgoksel/mechopter

PyGame-based quadcopter simulator & Reinforcement Learning Project

a2c dqn ipynb ppo pygame python quadcopter reinforcement-learning

Last synced: 28 Oct 2025

https://github.com/nashory/rl-tutorial-by-games

Game collection trained by SOTA reinforcement learning algorithms for tutorial

a2c atari deep-learning dqn game pytorch reinforcement-learning tutorial

Last synced: 14 Sep 2025

https://github.com/maik97/wacky-rl

Custom Reinforcement Learning Agents

a2c actor-critic deep-learning deep-reinforcement-learning dqn gym policy-gradient ppo proximal-policy-optimization reinforcement-learning rl-agents rl-algorithms sac soft-actor-critic

Last synced: 21 Sep 2025

https://github.com/weiyx16/rl_toy_example

Deep RL toy example based on gym package with several methods

a2c a3c continuous-mountain-car dqn-tensorflow gym mountain-car qlearning sarsa sarsa-lambda

Last synced: 15 Mar 2025

https://github.com/sophiaas/rlbase

Modular Deep RL infrastructure in PyTorch

a2c deep-reinforcement-learning hierarchical-reinforcement-learning option-critic policy-gradient ppo proximal-policy-optimization pytorch-rl reinforcement-learning

Last synced: 14 May 2026

https://github.com/mind/a2c

The OpenAI advantage actor critic (A2C) model

a2c openai openai-gym reinforcement-learning

Last synced: 08 May 2026

https://github.com/wondervictor/reinforcement.learning

Reinforcement Learning Playground

a2c ddpg deep-reinforcement-learning dqn dueling-network-architecture reinforcement-learning

Last synced: 17 Apr 2026

https://github.com/jasonzhu1313/analytics-zoo-reinforcement-learning

a2c analytics-zoo bigdl cartpole dqn flappy-bird policy-gradient reinforce reinforcement-learning

Last synced: 28 Apr 2026

https://github.com/dylanzenner/stock-rl

This repo contains our project for CPE 800 which focuses on reinforcement learning for stock prediction

a2c alpaca-trading-api aws ddpg ensemble github-actions ppo reinforcement-learning stablebaselines3 td3 terraform terraform-cloud

Last synced: 12 Jan 2026

https://github.com/rainlumostaipei/cuda-qnet-a2c

Qnet and A2C impl in cuda

a2c cuda qnet

Last synced: 26 Jun 2025

https://github.com/peiranli/rl

My RL agent in PyTorch

a2c continuous-versions ddpg policy-gradient ppo reinforcement-learning-algorithms

Last synced: 15 Jul 2025

https://github.com/yunjaechoi/rlagent

Reinforcement learning framework in tensorflow, compatible with OpenAI Gym like environments.

a2c ddpg reinforcement-learning tensorflow

Last synced: 11 May 2026

https://github.com/antoniszks/cartpole-optimization-with-reinforcement-learning

A university project where we implement and experiment with different Reinforcement Learning algorithms and trying to optimize the CartPole environment from OpenAI Gym.

a2c ai ai-agents dqn dueling-network-architecture gymnasium ml openai openai-gym ppo reinforcement-learning stable-baselines3 transformers

Last synced: 30 Apr 2026

https://github.com/ttitcombe/a3c

PyTorch implementation of Asynchronous (and Synchronous) Advantage Actor Critic

a2c a3c advantage-actor-critic pytorch pytorch-implementation reinforcement-learning reinforcement-learning-algorithms

Last synced: 30 Apr 2026

https://github.com/dhyanesh18/flappbird-rl

PPO agent and A2C agents for Flappybird. Includes scripts, training code, and evaluation tools.

a2c flappybird opencv ppo pygame-learning-environment reinforcement-learning stablebaselines3

Last synced: 06 May 2026

https://github.com/dmsovetov/reinforcement

Deep reinforcement learning experiments

a2c a3c deeplearning pytorch qlearning reinforce reinforcement-learning

Last synced: 07 May 2026

https://github.com/ethanmclark1/rl_toolkit

Implementation of core reinforcement learning algorithms with PyTorch

a2c deep-reinforcement-learning deep-rl dueling-dqn openai-gym policy-gradient pytorch q-table reinforcement-learning rl

Last synced: 16 Apr 2026

https://github.com/indutny/haggling_rl

RL learning model for Hola's Haggling Challenge

a2c haggle ppo reinforcement-learning

Last synced: 19 May 2026

https://github.com/starlitdreams/kungfu-a2c

Kung Fu Master A2C Agent is a PyTorch implementation of the Advantage Actor–Critic (A2C) algorithm, engineered to learn and play the Atari classic Kung Fu Master. It uses Gymnasium (with ALE ROMs) to manage multiple parallel environments and a custom preprocessing wrapper.

a2c a2c-agent actor-critic agentic-ai atari convolutional-neural-networks deep-reinforcement-learning gymnasium image-preprocessing kungfumaster python python3 pytorch reinforcement-learning

Last synced: 17 Apr 2026

https://github.com/mdraihansobhan/strategic-stock-trading-with-deep-reinforcement-learning-models

a2c ddpg deep-reinforcement-learning mvo ppo sac stock-price-prediction td3

Last synced: 14 Mar 2025