Projects in Awesome Lists tagged with ppo-pytorch

https://github.com/nikhilbarhate99/ppo-pytorch

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

deep-learning deep-reinforcement-learning policy-gradient ppo ppo-pytorch proximal-policy-optimization pytorch pytorch-implmention pytorch-tutorial reinforcement-learning reinforcement-learning-algorithms

Last synced: 15 May 2025

https://github.com/nikhilbarhate99/PPO-PyTorch

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

deep-learning deep-reinforcement-learning policy-gradient ppo ppo-pytorch proximal-policy-optimization pytorch pytorch-implmention pytorch-tutorial reinforcement-learning reinforcement-learning-algorithms

Last synced: 29 Apr 2025

https://github.com/taherfattahi/ppo-rocket-landing

Proximal Policy Optimization (PPO) algorithm using PyTorch to train an agent for a rocket landing task in a custom environment

ai machine-learning ppo ppo-pytorch pytorch reinforcement-learning

Last synced: 04 Apr 2025

https://github.com/solrikk/criptowhisper

TradeWhisperer is a sophisticated cryptocurrency trading bot that leverages advanced Reinforcement Learning techniques, specifically the Proximal Policy Optimization (PPO) algorithm, to navigate the complex world of crypto markets. Built with a focus on adaptability and risk management, this bot combines technical analysis with machine learning.

aitrade bybit bybit-api bybit-bot criptotrading finance ppo-pytorch python pytorch stable-baselines3 trade-bot trading trading-algorithms tradingapi

Last synced: 11 Apr 2025

https://github.com/paulchen2713/ris-miso-hwi-drl

Worst-case MSE Minimization for RIS-assisted mmWave MU-MISO Systems with Hardware Impairments and CSI Imperfection

digital-beamforming gymnasium ppo-pytorch reconfigurable-intelligent-surfaces reinforcement-learning stable-baselines3 wireless-communication

Last synced: 22 Nov 2024

https://github.com/imoneoi/xrl-ppo

Automated & super fast PyTorch deep reinforcement learning platform for autonomous driving

automated-machine-learning autonomous-driving autonomous-vehicles deep-learning deep-reinforcement-learning ppo ppo-pytorch pytorch reinforcement-learning

Last synced: 16 Dec 2024

https://github.com/solrikk/tradewhisper

TradeWhisperer is a sophisticated cryptocurrency trading bot that leverages advanced Reinforcement Learning techniques, specifically the Proximal Policy Optimization (PPO) algorithm, to navigate the complex world of crypto markets. Built with a focus on adaptability and risk management, this bot combines technical analysis with machine learning.

aitrade bybit bybit-api bybit-bot criptotrading finance ppo-pytorch python pytorch stable-baselines3 trade-bot trading trading-algorithms tradingapi

Last synced: 08 Dec 2024

https://github.com/datarohit/ai-mario-game

This is a Deep-Q Learning [Stable Baseline] based AI Mario Game where the Model Incrementally Learns and Improves to Play the Game.

deep-learning mario-game ppo-pytorch python tensorboard

Last synced: 09 Apr 2025

https://github.com/naidezhujimo/proximal-policy-optimization-ppo-for-bipedalwalker-v3

his repository contains an implementation of the Proximal Policy Optimization (PPO) algorithm to solve the BipedalWalker-v3 environment from the Gymnasium library. This project uses a combination of policy and value networks to learn a policy for controlling a bipedal walker.

deep-learning gae gymnasium ppo-pytorch pytorch rl

Last synced: 11 Mar 2025

https://github.com/achronus/rl_atari_games

An exploration of the effects of Intrinsic Motivation methods on RL algorithms using Atari games.

curiosity deep-reinforcement-learning dqn-pytorch empowerment intrinsic-motivation ppo-pytorch pytorch rainbow-dqn reinforcement-learning

Last synced: 29 Mar 2025

https://github.com/nonkloq/mazeharvest

A Grid Based RL Environment & Implementaions of few Deep-RL Algorithms.

dqn-pytorch ppo-pytorch pytorch reinforcement-learning rl-algorithms-pytorch rl-environment

Last synced: 05 Mar 2025

https://github.com/ruvenguna94/dialogue-summary-remove-toxic-text-ppo

Fine-tuning FLAN-T5 with PPO and PEFT to generate less toxic text summaries. This notebook leverages Meta AI's hate speech reward model and utilizes RLHF techniques for improved safety.

detoxification dialogue-summarization generative-ai hate-speech-detection nlp ppo-pytorch reward-model toxic-comment-classification toxicity-analysis

Last synced: 06 Apr 2025

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome