Projects in Awesome Lists tagged with ppo-algorithm
A curated list of projects in awesome lists tagged with ppo-algorithm .
https://github.com/vachanvy/reinforcement-learning
PyTorch implementations of algorithms from "Reinforcement Learning: An Introduction by Sutton and Barto", along with various RL research papers.
actor-critic-algorithm actor-critic-pytorch artificial-intelligence ddpg-algorithm deep-deterministic-policy-gradient deep-reinforcement-learning dqn dqn-pytorch policy-gradient policy-gradient-with-baseline ppo-algorithm proximal-policy-optimization pytorch reinforcement-learning reinforcement-learning-an-introduction rl-book soft-actor-critic-continuous sutton-barto-book
Last synced: 09 Apr 2025
https://github.com/mohammadzainabbas/reinforcement-learning-cs
💡 Grasp - Pick-and-place with a robotic hand 👨🏻💻
brax gym-environment mamba model-free-rl physics-engine ppo ppo-agent ppo-algorithm python pytorch reinforcement-learning sac
Last synced: 15 Jul 2025
https://github.com/negarhonarvar/deepreinforcementlearning
A Complete Collection of Deep RL Famous Algorithms implemented in Gymnasium most Popular environments
boltzmann-exploration cartpole-v1 d3qn dqn drl-algorithms gymnasium-environment lunar-lander ppo-algorithm sarsa softmax-exploration swimmer
Last synced: 23 Jul 2025
https://github.com/sayeang/aws-deepracer-autonomous-racing-model
Developed-an-AWS-DeepRacer-model-using-Python-&-the-PPO-algorithm,-leveraging-TensorFlow-to-train-&-fine-tune-a-deep-reinforcement-learning-model.-Designed-a-custom-reward-function-&-optimized-hyperparameters-to-improve-policy-learning-&-navigation-performance.-Utilized-AWS-infrastructure-for-scalable-training-&-deployment.
aws aws-deepracer aws-infrastructure deep-learning deepracer deployment hyperparameter-tuning machine-learning ppo-algorithm python rl scalable tensorflow training
Last synced: 26 Jun 2025
https://github.com/paul7513/aws-deepracer-autonomous-racing-model
el using Python & the PPO algorithm, leveraging TensorFlow to train & fine-tune a deep reinforcement learning model. Designed a custom reward function & optimized hyperparameters to improve policy learning & navigation performance. Utilized AWS infrastructure for scalable training & deployment.
aws aws-deepracer aws-infrastructure deep-learning deepracer deployment hyperparameter-tuning machine-learning ppo-algorithm python rl scalable tensorflow training
Last synced: 29 Mar 2025
https://github.com/fiscalcareerweek/aws-deepracer-autonomous-racing-model
Developed-an-AWS-DeepRacer-model-using-Python-&-the-PPO-algorithm,-leveraging-TensorFlow-to-train-&-fine-tune-a-deep-reinforcement-learning-model.-Designed-a-custom-reward-function-&-optimized-hyperparameters-to-improve-policy-learning-&-navigation-performance.-Utilized-AWS-infrastructure-for-scalable-training-&-deployment.
aws aws-deepracer aws-infrastructure deep-learning deepracer deployment hyperparameter-tuning machine-learning ppo-algorithm python rl scalable tensorflow training
Last synced: 26 Jul 2025
https://github.com/gummala-vinaykumar/aws-deepracer-autonomous-racing-model
Developed an AWS DeepRacer model using Python & the PPO algorithm, leveraging TensorFlow to train & fine-tune a deep reinforcement learning model. Designed a custom reward function & optimized hyperparameters to improve policy learning & navigation performance. Utilized AWS infrastructure for scalable training & deployment.
aws aws-deepracer aws-infrastructure deep-learning deepracer deployment hyperparameter-tuning machine-learning ppo-algorithm python rl scalable tensorflow training
Last synced: 30 Jul 2025
https://github.com/ashioyajotham/weather_forecasting_lora
How close can LoRA get to full fine-tuning (FullFT) in terms of learning speed, performance, and compute tradeoffs? And under what conditions?
finetuning lora-fine-tuning mistral-7b ppo-algorithm rlhf thinking-machines weather-forecasting
Last synced: 17 Oct 2025
https://github.com/haimanm3/aws-deepracer-autonomous-racing-model
Developed an AWS DeepRacer model using Python & the PPO algorithm, leveraging TensorFlow to train & fine-tune a deep reinforcement learning model. Designed a custom reward function & optimized hyperparameters to improve policy learning & navigation performance. Utilized AWS infrastructure for scalable training & deployment.
aws-infrastructure deep-learning deployment hyperparameter-tuning machine-learning ppo-algorithm python scalable tensorflow training
Last synced: 26 Oct 2025
https://github.com/sanatren/legal-document-analyzer
This Legal Document Analyzer is a proof-of-concept NLP project demonstrating the potential of transformers for legal document summarization.
bart byte-pair-encoding deep-learning finetuning-transformers huggingface ppo-algorithm reinforcement-learning-algorithms transformer
Last synced: 13 Jul 2025
https://github.com/yusufu790/aws-deepracer-autonomous-racing-model
el using Python & the PPO algorithm, leveraging TensorFlow to train & fine-tune a deep reinforcement learning model. Designed a custom reward function & optimized hyperparameters to improve policy learning & navigation performance. Utilized AWS infrastructure for scalable training & deployment.
aws aws-deepracer aws-infrastructure deep-learning deepracer deployment hyperparameter-tuning machine-learning ppo-algorithm python rl scalable tensorflow training
Last synced: 29 Mar 2025