An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with actor-critic-algorithm

A curated list of projects in awesome lists tagged with actor-critic-algorithm .

https://github.com/BY571/Soft-Actor-Critic-and-Extensions

PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL + D2RL and parallel Environments.

actor-critic-algorithm continuous d2rl emphasizing-recent-experience multi-environment munchausen munchausen-reinforcement-learning parallel-computing prioritized-experience-replay pytorch reinforcement-learning reinforcement-learning-algorithms sac soft-actor-critic

Last synced: 19 Jul 2025

https://github.com/rsgoksel/reinforcement-learning-ponggame

Reinforcement Learning - PPO (Proximal Policy Optimization) Implementation to Pong Game

actor-critic-algorithm pong-game ppo proximal-policy-optimization python reinforcement-learning

Last synced: 21 Jul 2025

https://github.com/shaheennabi/reinforcement-or-deep-reinforcement-learning-practices-and-mini-projects

Reinforcement Learning (RL)! This repository is your hands-on guide to implementing RL algorithms, from Markov Decision Processes (MDPs) to advanced methods like PPO and DDPG. Build smart agents, learn the math behind policies, and experiment with real-world applications!

actor-critic-algorithm agent markov-decision-processes model-based-rl model-free-rl monte-carlo policy-gradient policy-optimization proximal-policy-optimization reinforcement-learning research temporal-differencing-learning

Last synced: 11 Oct 2025

https://github.com/nima-siboni/simplest-world-actor-critic

Reinforcement learning, Policy Gradient, Actor-Critic, AC, Agent-based Simulation, Simple-world

actor-critic-algorithm monte-carlo-simulation on-policy reinforcement-learning reinforcement-learning-environments

Last synced: 08 Oct 2025

https://github.com/d-dawg78/mva_rl

Master MVA - Reinforcement Learning Project

actor-critic-algorithm echolocation td-learning

Last synced: 04 Apr 2025