An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with a2c

A curated list of projects in awesome lists tagged with a2c .

https://github.com/thu-ml/tianshou

An elegant PyTorch deep reinforcement learning library.

a2c atari bcq cql ddpg double-dqn dqn drl imitation-learning mujoco npg policy-gradient ppo pytorch rl sac td3 transferlab trpo

Last synced: 13 May 2025

https://github.com/vwxyzjn/cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

a2c actor-critic advantage-actor-critic ale atari deep-learning deep-reinforcement-learning gym machine-learning phasic-policy-gradient ppo proximal-policy-optimization python pytorch reinforcement-learning wandb

Last synced: 14 May 2025

https://github.com/andri27-ts/reinforcement-learning

Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python. Reinforcement Learning + Deep Learning

a2c artificial-intelligence deep-learning deep-reinforcement-learning deepmind dqn evolution-strategies machine-learning policy-gradients ppo qlearning reinforcement-learning

Last synced: 15 May 2025

https://github.com/andri27-ts/Reinforcement-Learning

Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python. Reinforcement Learning + Deep Learning

a2c artificial-intelligence deep-learning deep-reinforcement-learning deepmind dqn evolution-strategies machine-learning policy-gradients ppo qlearning reinforcement-learning

Last synced: 15 Mar 2025

https://github.com/ikostrikov/pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

a2c acktr actor-critic advantage-actor-critic ale atari continuous-control deep-learning deep-reinforcement-learning hessian kfac kronecker-factored-approximation mujoco natural-gradients ppo proximal-policy-optimization pytorch reinforcement-learning roboschool second-order

Last synced: 13 Apr 2025

https://github.com/seungeunrho/minimalrl

Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)

a2c a3c acer ddpg deep-learning deep-reinforcement-learning dqn machine-learning policy-gradients ppo pytorch reinforce reinforcement-learning sac simple

Last synced: 15 May 2025

https://github.com/seungeunrho/minimalRL

Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)

a2c a3c acer ddpg deep-learning deep-reinforcement-learning dqn machine-learning policy-gradients ppo pytorch reinforce reinforcement-learning sac simple

Last synced: 03 Apr 2025

https://github.com/kengz/slm-lab

Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".

a2c a3c benchmark deep-reinforcement-learning dqn policy-gradient ppo pytorch reinforcement-learning sac

Last synced: 11 Feb 2026

https://github.com/kengz/SLM-Lab

Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".

a2c a3c benchmark deep-reinforcement-learning dqn policy-gradient ppo pytorch reinforcement-learning sac

Last synced: 01 Apr 2025

https://github.com/khrylx/pytorch-rl

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

a2c deep-reinforcement-learning fisher-vectors generative-adversarial-network policy-gradient ppo proximal-policy-optimization pytorch pytorch-rl reinforcement-learning trpo

Last synced: 12 Apr 2025

https://github.com/Khrylx/PyTorch-RL

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

a2c deep-reinforcement-learning fisher-vectors generative-adversarial-network policy-gradient ppo proximal-policy-optimization pytorch pytorch-rl reinforcement-learning trpo

Last synced: 29 Apr 2025

https://github.com/lcswillems/rl-starter-files

RL starter files in order to immediately train, visualize and evaluate an agent without writing any line of code

a2c a3c minigrid multi-process ppo preprocessed-observations pytorch reward-shaping

Last synced: 26 Oct 2025

https://github.com/TianhongDai/reinforcement-learning-algorithms

This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)

a2c actor-critic algorithm atari2600 ddpg deep-learning deep-reinforcement-learning dqn dueling-dqn flappy-bird ppo proximal-policy-optimization pytorch sac soft-actor-critic trpo trust-region-policy-optimization

Last synced: 19 Jul 2025

https://github.com/archsyscall/deeprl-tensorflow2

🐋 Simple implementations of various popular Deep Reinforcement Learning algorithms using TensorFlow2

a2c a3c ddpg deep-learning deep-reinforcement-learning double-dqn dqn dueling-dqn machine-learning ppo rainbow-dqn reinforce reinforcement-learning sac tensorflow tensorflow2 trpo

Last synced: 05 Apr 2025

https://github.com/archsyscall/DeepRL-TensorFlow2

🐋 Simple implementations of various popular Deep Reinforcement Learning algorithms using TensorFlow2

a2c a3c ddpg deep-learning deep-reinforcement-learning double-dqn dqn dueling-dqn machine-learning ppo rainbow-dqn reinforce reinforcement-learning sac tensorflow tensorflow2 trpo

Last synced: 15 Oct 2025

https://github.com/dongminlee94/deep_rl

PyTorch implementation of deep reinforcement learning algorithms

a2c ddpg ddqn deep-reinforcement-learning dqn model-free-rl npg ppo pytorch sac sac-aea td3 trpo vpg

Last synced: 05 Apr 2025

https://github.com/inoryy/tensorflow2-deep-reinforcement-learning

Code accompanying the blog post "Deep Reinforcement Learning with TensorFlow 2.1"

a2c advantage-actor-critic deep-reinforcement-learning keras tensorflow tensorflow2

Last synced: 08 May 2025

https://github.com/tobiasfshr/deep-reinforcement-learning-drone-control

A drone control system based on deep reinforcement learning with Tensorflow and ROS

a2c deep-reinforcement-learning drone-controller gazebo ros

Last synced: 03 Jan 2026

https://github.com/godka/pensieve-ppo

The simplest implementation of Pensieve (SIGCOMM' 17) via state-of-the-art RL algorithms, including PPO, DQN, SAC, and support for both TensorFlow and PyTorch.

a2c deep-learning dqn pensieve ppo pytorch reinforcement-learning tensorflow

Last synced: 07 Apr 2025

https://github.com/jianzhnie/rltoolkit

RLToolkit is a flexible and high-efficient reinforcement learning framework. Include implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

a2c actor-critic ddpg ddqn dqn maddpg mappo ppo qmix rl sac td3 trpo

Last synced: 03 Aug 2025

https://github.com/xffxff/endorphin

Like dopamine, but for different algorithms

a2c dopamine ppo pytorch reinforcement-learning

Last synced: 12 Apr 2025

https://github.com/rsgoksel/mechopter

PyGame-based quadcopter simulator & Reinforcement Learning Project

a2c dqn ipynb ppo pygame python quadcopter reinforcement-learning

Last synced: 28 Oct 2025

https://github.com/nashory/rl-tutorial-by-games

Game collection trained by SOTA reinforcement learning algorithms for tutorial

a2c atari deep-learning dqn game pytorch reinforcement-learning tutorial

Last synced: 14 Sep 2025

https://github.com/weiyx16/rl_toy_example

Deep RL toy example based on gym package with several methods

a2c a3c continuous-mountain-car dqn-tensorflow gym mountain-car qlearning sarsa sarsa-lambda

Last synced: 15 Mar 2025

https://github.com/mind/a2c

The OpenAI advantage actor critic (A2C) model

a2c openai openai-gym reinforcement-learning

Last synced: 08 May 2026

https://github.com/dylanzenner/stock-rl

This repo contains our project for CPE 800 which focuses on reinforcement learning for stock prediction

a2c alpaca-trading-api aws ddpg ensemble github-actions ppo reinforcement-learning stablebaselines3 td3 terraform terraform-cloud

Last synced: 12 Jan 2026

https://github.com/rainlumostaipei/cuda-qnet-a2c

Qnet and A2C impl in cuda

a2c cuda qnet

Last synced: 26 Jun 2025

https://github.com/yunjaechoi/rlagent

Reinforcement learning framework in tensorflow, compatible with OpenAI Gym like environments.

a2c ddpg reinforcement-learning tensorflow

Last synced: 11 May 2026

https://github.com/antoniszks/cartpole-optimization-with-reinforcement-learning

A university project where we implement and experiment with different Reinforcement Learning algorithms and trying to optimize the CartPole environment from OpenAI Gym.

a2c ai ai-agents dqn dueling-network-architecture gymnasium ml openai openai-gym ppo reinforcement-learning stable-baselines3 transformers

Last synced: 30 Apr 2026

https://github.com/ttitcombe/a3c

PyTorch implementation of Asynchronous (and Synchronous) Advantage Actor Critic

a2c a3c advantage-actor-critic pytorch pytorch-implementation reinforcement-learning reinforcement-learning-algorithms

Last synced: 30 Apr 2026

https://github.com/dhyanesh18/flappbird-rl

PPO agent and A2C agents for Flappybird. Includes scripts, training code, and evaluation tools.

a2c flappybird opencv ppo pygame-learning-environment reinforcement-learning stablebaselines3

Last synced: 06 May 2026

https://github.com/dmsovetov/reinforcement

Deep reinforcement learning experiments

a2c a3c deeplearning pytorch qlearning reinforce reinforcement-learning

Last synced: 07 May 2026

https://github.com/ethanmclark1/rl_toolkit

Implementation of core reinforcement learning algorithms with PyTorch

a2c deep-reinforcement-learning deep-rl dueling-dqn openai-gym policy-gradient pytorch q-table reinforcement-learning rl

Last synced: 16 Apr 2026

https://github.com/indutny/haggling_rl

RL learning model for Hola's Haggling Challenge

a2c haggle ppo reinforcement-learning

Last synced: 19 May 2026

https://github.com/starlitdreams/kungfu-a2c

Kung Fu Master A2C Agent is a PyTorch implementation of the Advantage Actor–Critic (A2C) algorithm, engineered to learn and play the Atari classic Kung Fu Master. It uses Gymnasium (with ALE ROMs) to manage multiple parallel environments and a custom preprocessing wrapper.

a2c a2c-agent actor-critic agentic-ai atari convolutional-neural-networks deep-reinforcement-learning gymnasium image-preprocessing kungfumaster python python3 pytorch reinforcement-learning

Last synced: 17 Apr 2026