Projects in Awesome Lists tagged with reinforcement-learning-algorithms

https://github.com/dlr-rm/stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

baselines gsde gym machine-learning openai python pytorch reinforcement-learning reinforcement-learning-algorithms robotics sb3 sde stable-baselines toolbox

Last synced: 12 May 2025

https://github.com/DLR-RM/stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

baselines gsde gym machine-learning openai python pytorch reinforcement-learning reinforcement-learning-algorithms robotics sb3 sde stable-baselines toolbox

Last synced: 26 Mar 2025

https://github.com/udacity/deep-reinforcement-learning

Repo for the Deep Reinforcement Learning Nanodegree program

cross-entropy ddpg deep-reinforcement-learning dqn dynamic-programming hill-climbing ml-agents neural-networks openai-gym openai-gym-solutions ppo pytorch pytorch-rl reinforcement-learning reinforcement-learning-algorithms rl-algorithms

Last synced: 27 Nov 2024

https://github.com/hill-a/stable-baselines

A fork of OpenAI Baselines, implementations of reinforcement learning algorithms

baselines data-science gym machine-learning openai python reinforcement-learning reinforcement-learning-algorithms toolbox

Last synced: 26 Mar 2025

https://github.com/opendilab/di-engine

OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.

atari distributed-reinforcement-learning distributed-system drl exploration-exploitation imitation-learning impala inverse-reinforcement-learning minigrid model-based-reinforcement-learning mujoco multiagent-reinforcement-learning offline-rl python pytorch-rl r2d2 reinforcement-learning reinforcement-learning-algorithms self-play smac

Last synced: 12 May 2025

https://github.com/opendilab/DI-engine

OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.

atari distributed-reinforcement-learning distributed-system drl exploration-exploitation imitation-learning impala inverse-reinforcement-learning minigrid model-based-reinforcement-learning mujoco multiagent-reinforcement-learning offline-rl python pytorch-rl r2d2 reinforcement-learning reinforcement-learning-algorithms self-play smac

Last synced: 01 Apr 2025

https://github.com/nikhilbarhate99/ppo-pytorch

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

deep-learning deep-reinforcement-learning policy-gradient ppo ppo-pytorch proximal-policy-optimization pytorch pytorch-implmention pytorch-tutorial reinforcement-learning reinforcement-learning-algorithms

Last synced: 15 May 2025

https://github.com/nikhilbarhate99/PPO-PyTorch

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

deep-learning deep-reinforcement-learning policy-gradient ppo ppo-pytorch proximal-policy-optimization pytorch pytorch-implmention pytorch-tutorial reinforcement-learning reinforcement-learning-algorithms

Last synced: 29 Apr 2025

https://github.com/nicrusso7/rex-gym

OpenAI Gym environments for an open-source quadruped robot (SpotMicro)

artificial-intelligence gym-environment inverse-kinematics legged-robots machine-learning openai openai-gym openai-gym-environments pybullet python3 quadruped quadruped-robot quadruped-robot-gaits reinforcement-learning reinforcement-learning-algorithms robot robotic-arm robotics spotmicro tensorflow

Last synced: 08 Apr 2025

https://github.com/neuromatchacademy/course-content-dl

NMA deep learning course

continual-learning convolutional-neural-networks deep-learning recurrent-neural-networks reinforcement-learning-algorithms transformers

Last synced: 14 May 2025

https://github.com/NeuromatchAcademy/course-content-dl

NMA deep learning course

continual-learning convolutional-neural-networks deep-learning recurrent-neural-networks reinforcement-learning-algorithms transformers

Last synced: 08 May 2025

https://github.com/JuliaPOMDP/POMDPs.jl

MDPs and POMDPs in Julia - An interface for defining, solving, and simulating fully and partially observable Markov decision processes on discrete and continuous spaces.

artificial-intelligence control-systems julia markov-decision-processes mdps pomdps python reinforcement-learning reinforcement-learning-algorithms

Last synced: 09 May 2025

https://github.com/juliapomdp/pomdps.jl

MDPs and POMDPs in Julia - An interface for defining, solving, and simulating fully and partially observable Markov decision processes on discrete and continuous spaces.

artificial-intelligence control-systems julia markov-decision-processes mdps pomdps python reinforcement-learning reinforcement-learning-algorithms

Last synced: 14 May 2025

https://github.com/luchris429/purejaxrl

Really Fast End-to-End Jax RL Implementations

deep-reinforcement-learning jax ppo reinforcement-learning reinforcement-learning-algorithms

Last synced: 20 Mar 2025

https://github.com/cpnota/autonomous-learning-library

A PyTorch library for building deep reinforcement learning agents.

a2c advantage-actor-critic ddpg deep-deterministic-policy-gradient deep-q-learning deep-reinforcement-learning dqn dqn-pytorch ppo proximal-policy-optimization reinforcement-learning reinforcement-learning-algorithms sac soft-actor-critic

Last synced: 01 Apr 2025

https://github.com/skylark0924/rofunc

🤖 The Full Process Python Package for Robot Learning from Demonstration and Robot Manipulation

embodied-ai forward-kinematics humanoid humanoid-robots imitation-learning inverse-kinematics isaac-gym isaac-sim learning-from-demonstration manipulability optitrack planning-algorithms reinforcement-learning-algorithms robot robot-control robot-learning robot-manipulation robot-planning

Last synced: 14 May 2025

https://github.com/cbfinn/gps

Guided Policy Search

deep-learning deep-reinforcement-learning reinforcement-learning reinforcement-learning-algorithms robotics

Last synced: 04 Apr 2025

https://github.com/stable-baselines-team/stable-baselines3-contrib

Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code

experimental gsde gym machine-learning openai pytorch reinforcement-learning reinforcement-learning-algorithms research rl robotics sde stable-baselines

Last synced: 14 May 2025

https://github.com/Skylark0924/Rofunc

🤖 The Full Process Python Package for Robot Learning from Demonstration and Robot Manipulation

embodied-ai forward-kinematics humanoid humanoid-robots imitation-learning inverse-kinematics isaac-gym isaac-sim learning-from-demonstration manipulability optitrack planning-algorithms reinforcement-learning-algorithms robot robot-control robot-learning robot-manipulation robot-planning

Last synced: 02 Apr 2025

https://github.com/mishalaskin/curl

CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning

contrastive-learning contrastive-loss contrastive-predictive-coding curl deep-learning deep-learning-algorithms deep-neural-networks deep-q-learning deep-q-network deep-reinforcement-learning deep-rl deeplearning deeplearning-ai gpu model-free-rl off-policy reinforcement-agents reinforcement-learning reinforcement-learning-algorithms sac

Last synced: 05 Apr 2025

https://github.com/MishaLaskin/curl

CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning

contrastive-learning contrastive-loss contrastive-predictive-coding curl deep-learning deep-learning-algorithms deep-neural-networks deep-q-learning deep-q-network deep-reinforcement-learning deep-rl deeplearning deeplearning-ai gpu model-free-rl off-policy reinforcement-agents reinforcement-learning reinforcement-learning-algorithms sac

Last synced: 23 Nov 2024

https://github.com/Omegastick/pytorch-cpp-rl

PyTorch C++ Reinforcement Learning

a2c actor-critic advantage-actor-critic continuous-control cplusplus cpp libtorch ppo proximal-policy-optimization pytorch pytorch-cpp-frontend pytorch-rl reinforcement-learning reinforcement-learning-algorithms

Last synced: 07 May 2025

https://github.com/qiwihui/reinforcement-learning-an-introduction-chinese

《Reinforcement Learning: An Introduction》（第二版）中文翻译

reinforcement-learning reinforcement-learning-algorithms sphinx-doc

Last synced: 05 Apr 2025

https://github.com/EricSteinberger/PokerRL

Framework for Multi-Agent Deep Reinforcement Learning in Poker

deep-learning framework gym-environment poker ray reinforcement-learning reinforcement-learning-algorithms research

Last synced: 27 Mar 2025

https://github.com/sforaidl/genrl

A PyTorch reinforcement learning library for generalizable and reproducible algorithm implementations with an aim to improve accessibility in RL

algorithm-implementations benchmarking data-science deep-learning gym hacktoberfest machine-learning neural-network openai python pytorch reinforcement-learning reinforcement-learning-algorithms

Last synced: 04 Apr 2025

https://github.com/SforAiDl/genrl

A PyTorch reinforcement learning library for generalizable and reproducible algorithm implementations with an aim to improve accessibility in RL

algorithm-implementations benchmarking data-science deep-learning gym hacktoberfest machine-learning neural-network openai python pytorch reinforcement-learning reinforcement-learning-algorithms

Last synced: 01 May 2025

https://github.com/pku-alignment/safe-policy-optimization

NeurIPS 2023: Safe Policy Optimization: A benchmark repository for safe reinforcement learning algorithms

benchmarks constrained-reinforcement-learning reinforcement-learning-algorithms safe safe-reinforcement-learning

Last synced: 07 May 2025

https://github.com/huawei-noah/xingtian

xingtian is a componentized library for the development and verification of reinforcement learning algorithms

dqn impala muzero ppo qmix reinforcement-learning-algorithms

Last synced: 05 Apr 2025

https://github.com/nikhilbarhate99/hierarchical-actor-critic-hac-pytorch

PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments

actor-critic deep-reinforcement-learning gym-environment gym-environments hierarchical-reinforcement-learning openai-gym pytorch pytorch-implementation pytorch-rl reinforcement-learning reinforcement-learning-algorithms

Last synced: 09 Apr 2025

https://github.com/nikhilbarhate99/Hierarchical-Actor-Critic-HAC-PyTorch

PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments

actor-critic deep-reinforcement-learning gym-environment gym-environments hierarchical-reinforcement-learning openai-gym pytorch pytorch-implementation pytorch-rl reinforcement-learning reinforcement-learning-algorithms

Last synced: 10 May 2025

https://github.com/opendilab/di-engine-docs

DI-engine docs (Chinese and English)

deep-learning imitation-learning inverse-reinforcement-learning model-based-reinforcement-learning multi-agent-reinforcement-learning offline-rl pytorch-rl reinforcement-learning reinforcement-learning-algorithms

Last synced: 06 Apr 2025

https://github.com/jxareas/machine-learning-notebooks

The full collection of Jupyter Notebook labs from Andrew Ng's Machine Learning Specialization.

clustering deep-learning jupyter-notebook kmeans learn linear-regression logistic-regression machine-learning machine-learning-algorithms neural-network numpy python regression reinforcement-learning reinforcement-learning-algorithms supervised-learning tensorflow unsupervised-learning

Last synced: 16 May 2025

https://github.com/imagry/aleph_star

Reinforcement learning with A* and a deep heuristic

control-systems dqn machine-learning-algorithms optimization-algorithms reinforcement-learning reinforcement-learning-algorithms shortest-path

Last synced: 27 Nov 2024

https://github.com/aurimas13/machine-learning-goodness

The Machine Learning project including ML/DL projects, notebooks, cheat codes of ML/DL, useful information on AI/AGI and codes or snippets/scripts/tasks with tips.

algorithms artifcial-intelligence artificial-intelligence chatgpt cheatsheets computer-science data-science deep-neural-networks deep-reinforcement-learning gpt4 machine-learning machine-learning-algorithms mlops python python3 reinforcement-learning reinforcement-learning-algorithms tips tips-and-tricks

Last synced: 21 Apr 2025

https://github.com/BY571/Soft-Actor-Critic-and-Extensions

PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL + D2RL and parallel Environments.

actor-critic-algorithm continuous d2rl emphasizing-recent-experience multi-environment munchausen munchausen-reinforcement-learning parallel-computing prioritized-experience-replay pytorch reinforcement-learning reinforcement-learning-algorithms sac soft-actor-critic

Last synced: 27 Nov 2024

https://github.com/bentrevett/pytorch-rl

Tutorials for reinforcement learning in PyTorch and Gym by implementing a few of the popular algorithms. [IN PROGRESS]

a2c actor-critic advantage-actor-critic generalized-advantage-estimation policy-gradient pytorch pytorch-implementation pytorch-implmention pytorch-rl pytorch-tutorial pytorch-tutorials reinforcement-learning reinforcement-learning-algorithms rl

Last synced: 27 Mar 2025

https://github.com/binary-husky/hmp2g

Multiagent Reinforcement Learning Research Project

machine-learning reinforcement-learning-algorithms simulation

Last synced: 04 Apr 2025

https://github.com/nasdin/reinforcementlearning-atarigame

Pytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advantage Actor-Critic (A3C) Algorithm. This is much superior and efficient than DQN and obsoletes it. Can play on many games

a3c a3c-lstm actor-critic adam asynchronous-advantage-actor-critic deep-reinforcement-learning lstm openai-gym python pytorch reinforcement-agents reinforcement-learning reinforcement-learning-algorithms rmsprop universe

Last synced: 10 Apr 2025

https://github.com/coax-dev/coax

Modular framework for Reinforcement Learning in python

reinforcement-learning reinforcement-learning-agent reinforcement-learning-algorithms

Last synced: 13 May 2025

https://github.com/kkuette/TradzQAI

Trading environnement for RL agents, backtesting and training.

algorithm backtesting bitcoin bitcoin-bot reinforcement-learning reinforcement-learning-agent reinforcement-learning-algorithms trading trading-algorithms trading-bot trading-env

Last synced: 24 Mar 2025

https://github.com/gordicaleksa/pytorch-learn-reinforcement-learning

A collection of various RL algorithms like policy gradients, DQN and PPO. The goal of this repo will be to make it a go-to resource for learning about RL. How to visualize, debug and solve RL problems. I've additionally included playground.py for learning more about OpenAI gym, etc.

deep-learning deep-q-network dqn jupyter policy-gradient ppo python pytorch pytorch-dqn pytorch-implementation pytorch-policy-gradient pytorch-ppo reinforcement-learning reinforcement-learning-algorithms rl

Last synced: 26 Apr 2025

https://github.com/xiangwang1223/kgpolicy

Reinforced Negative Sampling over Knowledge Graph for Recommendation, WWW2020

explainable-recommendation knowledge-aware-recommendation knowledge-based-recommendation knowledge-graph knowledge-graph-dataset knowledge-graph-for-recommendation negative-sampling recommender-system reinforcement-learning reinforcement-learning-algorithms www2020

Last synced: 20 Dec 2024

https://github.com/BY571/DQN-Atari-Agents

DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow, and DRQN

atari c51 ddqn-pyotrch deep-reinforcement-learning dqn-pytorch drqn dueling-dqn-pytorch multi-environment multi-step-dqn multiprocessing n-step-dqn noisy-dqn openai parallel-computing prioritized-experience-replay rainbow reinforcement-learning-agent reinforcement-learning-algorithms

Last synced: 05 May 2025

https://github.com/activatedgeek/torchrl

Highly Modular and Scalable Reinforcement Learning

deep-learning deep-reinforcement-learning dqn machine-learning policy-gradient python3 pytorch reinforcement-learning reinforcement-learning-algorithms

Last synced: 17 Mar 2025

https://github.com/XinJingHao/TD3-BipedalWalkerHardcore-v2

Solve BipedalWalkerHardcore-v2 with TD3

reinforcement-learning-algorithms robot

Last synced: 28 Nov 2024

https://github.com/yaricom/goneat

The GOLang implementation of NeuroEvolution of Augmented Topologies (NEAT) method to evolve and train Artificial Neural Networks without error back propagation

artificial-neural-networks augmenting-topologies neat neural-network neuroevolution reinforcement-learning reinforcement-learning-algorithms unsupervised-learning unsupervised-machine-learning

Last synced: 06 Apr 2025

https://github.com/namoshizun/pypomdp

Python implementation of POMDP framework and PBVI & POMCP algorithms.

educational pomdp python reinforcement-learning-algorithms

Last synced: 21 Mar 2025

https://github.com/nikhilbarhate99/actor-critic-pytorch

Policy Gradient Actor-Critic PyTorch | Lunar Lander v2

a2c actor-critic deep-reinforcement-learning openai-gym openai-gym-environments policy-gradient pytorch pytorch-implmention pytorch-tutorial reinforcement-learning-algorithms

Last synced: 04 May 2025

https://github.com/diditforlulz273/PokerRL-Omaha

Omaha Poker functionality+some features for PokerRL Reinforcement Learning card framwork

cfr counterfactual-regret-minimization deep-learning monte-carlo-tree-search omaha-poker poker-bot pytorch reinforcement-learning reinforcement-learning-algorithms

Last synced: 27 Mar 2025

https://github.com/ugurkanates/spacexreinforcementlearning

SpaceX Falcon 9 simulated with Reinforcement Learning algorithms such as D4PG,SAC and PPO.

reinforcement-learning reinforcement-learning-algorithms reinforcement-learning-environments rocket spacex

Last synced: 08 Feb 2025

https://github.com/ikostrikov/pytorch-rl

pytorch reinforcement-learning reinforcement-learning-algorithms

Last synced: 30 Apr 2025

https://github.com/scitator/papers

arxiv deep-learning deep-reinforcement-learning papers reinforcement-learning reinforcement-learning-algorithms

Last synced: 29 Mar 2025

https://github.com/yaricom/goneat_ns

This project provides GOLang implementation of Neuro-Evolution of Augmenting Topologies (NEAT) with Novelty Search optimization aimed to solve deceptive tasks with strong local optima

artificial-neural-networks augmenting-topologies explainable-ai explainable-artificial-intelligence golang modular-ai neat neuroevolution novelty-search reinforcement-learning-algorithms unsupervised-learning unsupervised-learning-algorithms unsupervised-machine-learning

Last synced: 05 Apr 2025

https://github.com/xuehaipan/mate

MATE: the Multi-Agent Tracking Environment.

multi-agent-reinforcement-learning openai-gym openai-gym-environment reinforcement-learning reinforcement-learning-algorithms reinforcement-learning-environment

Last synced: 01 May 2025

https://github.com/mazzzystar/qlearningmouse

Cat-and-Mouse game with Reinforcement Learning (Q-Learning).

qlearning-algorithm reinforcement-learning reinforcement-learning-algorithms reinforcement-learning-excercises

Last synced: 28 Apr 2025

https://github.com/diovisgood/intraday

Gym environment which simulates intraday trading

candle candlestick candlestick-chart environment gym gym-environment intraday reinforcement-learning reinforcement-learning-algorithms stream trades trading trading-bot trading-strategies

Last synced: 10 Apr 2025

https://github.com/baskuit/r-nad

Experimentation with Regularized Nash Dynamics on a GPU accelerated game

deepnash multiagent-reinforcement-learning pytorch reinforcement-learning reinforcement-learning-algorithms rnad

Last synced: 30 Jan 2025

https://github.com/mimoralea/king-pong

Deep Reinforcement Learning Pong Agent, King Pong, he's the best

agent deep-learning deep-q-network deep-reinforcement-learning dqn king-pong machine-learning percept q-learning reinforcement-learning reinforcement-learning-algorithms

Last synced: 15 Apr 2025

https://github.com/jason-cky/deeprl-pytorch

Pytorch implementations of various Deep Reinforcement Learning algorithms on pybullet environments.

ddpg ppo pybullet-environments python3 pytorch-implementation reinforcement-learning-algorithms rlbench td3 trpo

Last synced: 20 Nov 2024

https://github.com/v-i-s-h/mab.jl

A Julia Package for providing Multi Armed Bandit Experiments

bandit-experiments exp julia julia-language julia-package julialang mab multi-arm-bandits reinforcement-learning reinforcement-learning-algorithms thompson-sampling ucb

Last synced: 01 May 2025

https://github.com/sadrasabouri/pyrandwalk

:walking:Python Library for Random Walks

education educational markov-chain networkx probabilistic-graphical-models probability python random-walk reinforcement-learning reinforcement-learning-algorithms simulation stochastic-processes

Last synced: 16 Mar 2025

https://github.com/seungjaeryanlee/rl-exploration

Reinforcement Learning papers on exploration methods.

exploration papers reinforcement-learning reinforcement-learning-algorithms research

Last synced: 25 Jan 2025

https://github.com/gibbsbravo/paraphrasee

Paraphrase Generation Using Deep Reinforcement Learning - MSc Thesis

deep-learning deep-reinforcement-learning natural-language-generation natural-language-processing paraphrase-detection paraphrase-generation paraphrase-identification reinforcement-learning reinforcement-learning-agent reinforcement-learning-algorithms reinforcement-learning-environments

Last synced: 12 Apr 2025

https://github.com/ondrejbiza/bandits

Comparison of bandit algorithms from the Reinforcement Learning bible.

machine-learning reinforcement-learning reinforcement-learning-agent reinforcement-learning-algorithms sutton-book

Last synced: 26 Apr 2025

https://github.com/ondrejbiza/racetrack

An environment for tabular Reinforcement Learning agents.

machine-learning reinforcement-learning reinforcement-learning-agent reinforcement-learning-algorithms sutton-book

Last synced: 26 Apr 2025

https://github.com/epignatelli/discovering-reinforcement-learning-algorithms

A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. and Silver, D., 2020. Discovering reinforcement learning algorithms. Advances in Neural Information Processing Systems, 33.

actor-critic general-meta-learning jax lstm meta-learning paper-implementations paper-with-code policy-gradient reinforcement-learning reinforcement-learning-algorithms rnn stax

Last synced: 02 Mar 2025

https://github.com/xuehaipan/soft-actor-critic

PyTorch Implementation of Soft Actor-Critic Algorithm

actor-critic actor-critic-algorithm reinforcement-learning-algorithms soft-actor-critic

Last synced: 07 May 2025

https://github.com/ajaysub110/rlin200lines

PyTorch implementations of Reinforcement Learning algorithms in less than 200 lines

deep-reinforcement-learning dqn machine-learning policy-gradient ppo pytorch-implementations reinforcement-learning reinforcement-learning-algorithms soft-actor-critic

Last synced: 05 Dec 2024

https://github.com/pockerman/qubic_engine

Collection of C++ based algorithms on numerics, statistics, control, reinforcement learning, machine learning and robotics

control cpp cpp17 extended-kalman-filter filtering finite-element-method finite-volume-method gradient-descent kalman-filter machine-learning numerics physics-simulation reinforcement-learning-algorithms robotics statistics

Last synced: 12 Apr 2025

https://github.com/epignatelli/helx

Interoperating between (Deep) Reiforcement Learning libraries

deep-learning flax jax reinforcement-learning reinforcement-learning-algorithms reinforcement-learning-environments rl rl-environments

Last synced: 02 Mar 2025

https://github.com/eantcal/nunn

Collection of Machine Learning Algorithms

cplusplus-17 deep-neural-networks handwritten-digits linux machine-learning machine-learning-algorithms macos mnist modern-cpp multilayer-perceptron multilayer-perceptron-network neural-network ocr-test qlearning qlearning-algorithm reinforcement-learning-algorithms sarsa tictactoe windows xor-problem

Last synced: 20 Nov 2024

https://github.com/baggepinnen/deterministicpolicygradient.jl

Reinforcement learning with Deterministic Policy Gradient methods

algorithm reinforcement-learning reinforcement-learning-algorithms

Last synced: 15 Mar 2025

https://github.com/zrr1999/reinforcement-learning-with-pytorch

莫烦强化学习教程的PyTorch实现

deep-learning pytorch reinforcement-learning reinforcement-learning-algorithms rl

Last synced: 11 Apr 2025

https://github.com/cyrildever/reinforcement-learning-in-golang

Code for the algorithms of the "Reinforcement Learning" book

golang machine-learning reinforcement-learning-algorithms

Last synced: 10 Apr 2025

https://github.com/wjaskowski/mastering-2048

An efficient reinforcement learning algorithm for learning a strategy for game 2048

2048 ntuples reinforcement-learning reinforcement-learning-algorithms td-learning

Last synced: 12 Apr 2025

https://github.com/epignatelli/reinforcement-learning-an-introduction

A python implementation of the concepts in the book "Reinforcement Learning: An Introduction" by R.S. Sutton and A. G. Barto.

dynamic-programming papers-with-code reinforcement-learning reinforcement-learning-algorithms reinforcement-learning-excercises reinforcement-learning-tutorials sutton-barto-book

Last synced: 02 Mar 2025

https://github.com/kyegomez/hindsightreplay

My implementation of Hindsight replay in PyTorch: "Hindsight Experience Replay"

artificial-intelligence machine-learning reinforcement-learning-algorithms reinfrocement-learning

Last synced: 07 May 2025

https://github.com/chudleyj/rl.cpp

Reinforcement Learning for stocks in C++

c-plus-plus cpp cpp11 machine-learning machine-learning-algorithms reinforcement-learning reinforcement-learning-algorithms stock-data stock-price-prediction stock-prices

Last synced: 11 Apr 2025

https://github.com/thomashirtz/soft-actor-critic

Implementation of the Soft Actor Critic algorithm using Pytorch.

actor-critic openai-gym reinforcement-learning reinforcement-learning-algorithms

Last synced: 09 Apr 2025

https://github.com/djbyrne/core_rl

Repo of core reinforcement learning algorithms and explanations using pytorch lightning

pytorch pytorch-lightning reinforcement-learning reinforcement-learning-agent reinforcement-learning-algorithms

Last synced: 30 Jan 2025

https://github.com/ghubnerr/darwin

A thorough exploration of Reinforcement Learning through OpenAI Gymnasiums. Inspired by OpenAI's "Emergent tool use form multi-agent interaction".

deep-q-learning multi-agent-reinforcement-learning openai-gym reinforcement-learning reinforcement-learning-algorithms

Last synced: 30 Dec 2024

https://github.com/ondrejbiza/aamas_19

Source code for the paper "Online Abstraction with MDP Homomorphisms for Deep Learning".

aamas abstraction deep-learning deep-neural-networks reinforcement-learning reinforcement-learning-algorithms

Last synced: 26 Apr 2025

https://github.com/sintefneodroid/aav

Autonomous Aerial Vehicle

aav agent drone machine-learning machine-learning-algorithms ml planning quadcopter reinforcement reinforcement-agents reinforcement-learning reinforcement-learning-agent reinforcement-learning-algorithms reinforcement-learning-playground rl

Last synced: 12 Apr 2025

https://github.com/howl-anderson/q_learning_demo

Show how Q-learning works from scratch

gym-environment q-learning reinforcement-learning reinforcement-learning-algorithms

Last synced: 21 Nov 2024

https://github.com/gokulp01/bluerov2_gym

A Gymnasium environment for simulating and training reinforcement learning agents on the BlueROV2 underwater vehicle.

autonomous-robots bluerov2 gymnasium gymnasium-environment reinforcement-learning reinforcement-learning-algorithms reinforcement-learning-environments robotics robotics-simulation rov underwater-robotics

Last synced: 14 Dec 2024

https://github.com/zeekersky/target_strike_game

Target Strike game is an unity based compititive game. This game is created for CS662 - Mobile VR & AI course offered at IIT Mandi. Here the source files are given.

behavioral-cloning gail game-development mlagents ppo reinforcement-learning-algorithms unity3d

Last synced: 09 Apr 2025

https://github.com/amifunny/reinforce_adventure

This Repository contains my implementation of popular algorithms on popular environments.

actor-critic bandit ddpg dqn dqn-tensorflow gym openai reinforcement-learning reinforcement-learning-algorithms rl

Last synced: 25 Mar 2025

https://github.com/rickstaa/stable-learning-control

A framework for training theoretically stable (and robust) Reinforcement Learning control algorithms.

artificial-intelligence control deep-learning framework gaussian-networks gymnasium machine-learning neural-networks openai-gym reinforcement-learning reinforcement-learning-agents reinforcement-learning-algorithms robustness simulation stability

Last synced: 13 Feb 2025

https://github.com/gabotechs/lazaro

Reinforcement learning framework for implementing custom models on custom environments using state of the art RL algorithms

actor-critic artificial-intelligence deep-learning deep-learning-algorithms deep-q-learning deep-q-learning-network deep-q-network deep-reinforcement-learning ppo reinforcement-learning reinforcement-learning-agent reinforcement-learning-algorithms

Last synced: 14 Apr 2025

https://github.com/jianzhnie/llmtech

LLMTechSite, 专注于通用人工智能领域的技术生态。

aigc diffusion-models llms reinforcement-learning-algorithms

Last synced: 08 Apr 2025

https://github.com/davestroud/q-learning

Q-Learning and Deep Q-Learning Demo

qlearning qlearning-algorithm reinforcement-learning reinforcement-learning-algorithms

Last synced: 12 Apr 2025

https://github.com/epignatelli/human-level-control-through-deep-reinforcement-learning

A jax/stax implementation of: Mnih, V., Kavukcuoglu, K., Silver, D., Rusu, A.A., Veness, J., Bellemare, M.G., Graves, A., Riedmiller, M., Fidjeland, A.K., Ostrovski, G. and Petersen, S., 2015. Human-level control through deep reinforcement learning. nature, 518(7540), pp.529-533.

atari deep-reinforcement-learning deepmind dqn jax papers-with-code reinforcement-learning reinforcement-learning-algorithms stax

Last synced: 02 Mar 2025

https://github.com/openagi/deeprl

Tensorflow framework for deep reinforcement learning

deep-reinforcement-learning reinforcement-learning-algorithms tensorflow-framework

Last synced: 12 Apr 2025

https://github.com/pegah-ardehkhani/shortest-path-using-reinforcement-learning

Solve the shortest path problem using Reinforcement Learning. This project applies RL techniques, such as Q-learning and SARSA(λ), to find optimal routes in a weighted graph, where the algorithm learns to navigate by receiving rewards based on edge distances.

q-learning reinforcement-learning reinforcement-learning-algorithms sarsa sarsa-lambda shortest-path