Projects in Awesome Lists tagged with trust-region-policy-optimization
A curated list of projects in awesome lists tagged with trust-region-policy-optimization .
https://github.com/TianhongDai/reinforcement-learning-algorithms
This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)
a2c actor-critic algorithm atari2600 ddpg deep-learning deep-reinforcement-learning dqn dueling-dqn flappy-bird ppo proximal-policy-optimization pytorch sac soft-actor-critic trpo trust-region-policy-optimization
Last synced: 27 Nov 2024
https://github.com/ikostrikov/pytorch-trpo
PyTorch implementation of Trust Region Policy Optimization
continuous-control deep-learning deep-reinforcement-learning mujoco pytorch reinforcement-learning trpo trust-region-policy-optimization
Last synced: 06 Apr 2025
https://github.com/funnydman/bfgs-neldermead-trustregion
Python implementation of some numerical (optimization) methods
ai bfgs dogleg-algorithm dogleg-method machine-learning machine-learning-algorithms mathematics nelder-mead numerical-methods numerical-optimization optimization python trust-region trust-region-dogleg-algorithm trust-region-policy-optimization
Last synced: 21 Mar 2025
https://github.com/hcnoh/rl-collection-pytorch
A collection of Reinforcement Learning implementations with PyTorch
actor-critic continuous-control deep-learning deep-reinforcement-learning gae generalized-advantage-estimation openai-gym policy-gradient ppo proximal-policy-optimization pytorch reinforcement-learning trpo trust-region-policy-optimization
Last synced: 30 Apr 2025
https://github.com/lihangliu/cs395t-numerical-optimization
Course projects of CS395T Numerical Optimization, UT Austin
optimization proximal-policy-optimization trust-region-policy-optimization
Last synced: 02 Mar 2025
https://github.com/legalaspro/rl-odyssey
RL-Odyssey is a research framework for continuous control that implements state-of-the-art RL algorithms (SAC, TD3, PPO, etc.) with clean experiment scripts and interactive notebooks.
a3c actor-critic continuous-control ddpg deep-learning deep-reinforcement-learning dm-control gymnasium mujoco policy-gradient ppo proximal-policy-optimization pytorch reinforcement-learning sac soft-actor-critic td3 trpo trust-region-policy-optimization
Last synced: 04 Mar 2025