Projects in Awesome Lists tagged with reinforcement-learning-algorithms
A curated list of projects in awesome lists tagged with reinforcement-learning-algorithms .
https://github.com/dlr-rm/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
baselines gsde gym machine-learning openai python pytorch reinforcement-learning reinforcement-learning-algorithms robotics sb3 sde stable-baselines toolbox
Last synced: 12 May 2025
https://github.com/DLR-RM/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
baselines gsde gym machine-learning openai python pytorch reinforcement-learning reinforcement-learning-algorithms robotics sb3 sde stable-baselines toolbox
Last synced: 26 Mar 2025
https://github.com/udacity/deep-reinforcement-learning
Repo for the Deep Reinforcement Learning Nanodegree program
cross-entropy ddpg deep-reinforcement-learning dqn dynamic-programming hill-climbing ml-agents neural-networks openai-gym openai-gym-solutions ppo pytorch pytorch-rl reinforcement-learning reinforcement-learning-algorithms rl-algorithms
Last synced: 27 Nov 2024
https://github.com/hill-a/stable-baselines
A fork of OpenAI Baselines, implementations of reinforcement learning algorithms
baselines data-science gym machine-learning openai python reinforcement-learning reinforcement-learning-algorithms toolbox
Last synced: 26 Mar 2025
https://github.com/opendilab/di-engine
OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.
atari distributed-reinforcement-learning distributed-system drl exploration-exploitation imitation-learning impala inverse-reinforcement-learning minigrid model-based-reinforcement-learning mujoco multiagent-reinforcement-learning offline-rl python pytorch-rl r2d2 reinforcement-learning reinforcement-learning-algorithms self-play smac
Last synced: 12 May 2025
https://github.com/opendilab/DI-engine
OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.
atari distributed-reinforcement-learning distributed-system drl exploration-exploitation imitation-learning impala inverse-reinforcement-learning minigrid model-based-reinforcement-learning mujoco multiagent-reinforcement-learning offline-rl python pytorch-rl r2d2 reinforcement-learning reinforcement-learning-algorithms self-play smac
Last synced: 01 Apr 2025
https://github.com/nikhilbarhate99/ppo-pytorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
deep-learning deep-reinforcement-learning policy-gradient ppo ppo-pytorch proximal-policy-optimization pytorch pytorch-implmention pytorch-tutorial reinforcement-learning reinforcement-learning-algorithms
Last synced: 15 May 2025
https://github.com/nikhilbarhate99/PPO-PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
deep-learning deep-reinforcement-learning policy-gradient ppo ppo-pytorch proximal-policy-optimization pytorch pytorch-implmention pytorch-tutorial reinforcement-learning reinforcement-learning-algorithms
Last synced: 29 Apr 2025
https://github.com/nicrusso7/rex-gym
OpenAI Gym environments for an open-source quadruped robot (SpotMicro)
artificial-intelligence gym-environment inverse-kinematics legged-robots machine-learning openai openai-gym openai-gym-environments pybullet python3 quadruped quadruped-robot quadruped-robot-gaits reinforcement-learning reinforcement-learning-algorithms robot robotic-arm robotics spotmicro tensorflow
Last synced: 08 Apr 2025
https://github.com/neuromatchacademy/course-content-dl
NMA deep learning course
continual-learning convolutional-neural-networks deep-learning recurrent-neural-networks reinforcement-learning-algorithms transformers
Last synced: 14 May 2025
https://github.com/NeuromatchAcademy/course-content-dl
NMA deep learning course
continual-learning convolutional-neural-networks deep-learning recurrent-neural-networks reinforcement-learning-algorithms transformers
Last synced: 08 May 2025
https://github.com/JuliaPOMDP/POMDPs.jl
MDPs and POMDPs in Julia - An interface for defining, solving, and simulating fully and partially observable Markov decision processes on discrete and continuous spaces.
artificial-intelligence control-systems julia markov-decision-processes mdps pomdps python reinforcement-learning reinforcement-learning-algorithms
Last synced: 09 May 2025
https://github.com/juliapomdp/pomdps.jl
MDPs and POMDPs in Julia - An interface for defining, solving, and simulating fully and partially observable Markov decision processes on discrete and continuous spaces.
artificial-intelligence control-systems julia markov-decision-processes mdps pomdps python reinforcement-learning reinforcement-learning-algorithms
Last synced: 14 May 2025
https://github.com/luchris429/purejaxrl
Really Fast End-to-End Jax RL Implementations
deep-reinforcement-learning jax ppo reinforcement-learning reinforcement-learning-algorithms
Last synced: 20 Mar 2025
https://github.com/cpnota/autonomous-learning-library
A PyTorch library for building deep reinforcement learning agents.
a2c advantage-actor-critic ddpg deep-deterministic-policy-gradient deep-q-learning deep-reinforcement-learning dqn dqn-pytorch ppo proximal-policy-optimization reinforcement-learning reinforcement-learning-algorithms sac soft-actor-critic
Last synced: 01 Apr 2025
https://github.com/skylark0924/rofunc
🤖 The Full Process Python Package for Robot Learning from Demonstration and Robot Manipulation
embodied-ai forward-kinematics humanoid humanoid-robots imitation-learning inverse-kinematics isaac-gym isaac-sim learning-from-demonstration manipulability optitrack planning-algorithms reinforcement-learning-algorithms robot robot-control robot-learning robot-manipulation robot-planning
Last synced: 14 May 2025
https://github.com/cbfinn/gps
Guided Policy Search
deep-learning deep-reinforcement-learning reinforcement-learning reinforcement-learning-algorithms robotics
Last synced: 04 Apr 2025
https://github.com/stable-baselines-team/stable-baselines3-contrib
Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code
experimental gsde gym machine-learning openai pytorch reinforcement-learning reinforcement-learning-algorithms research rl robotics sde stable-baselines
Last synced: 14 May 2025
https://github.com/Skylark0924/Rofunc
🤖 The Full Process Python Package for Robot Learning from Demonstration and Robot Manipulation
embodied-ai forward-kinematics humanoid humanoid-robots imitation-learning inverse-kinematics isaac-gym isaac-sim learning-from-demonstration manipulability optitrack planning-algorithms reinforcement-learning-algorithms robot robot-control robot-learning robot-manipulation robot-planning
Last synced: 02 Apr 2025
https://github.com/mishalaskin/curl
CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning
contrastive-learning contrastive-loss contrastive-predictive-coding curl deep-learning deep-learning-algorithms deep-neural-networks deep-q-learning deep-q-network deep-reinforcement-learning deep-rl deeplearning deeplearning-ai gpu model-free-rl off-policy reinforcement-agents reinforcement-learning reinforcement-learning-algorithms sac
Last synced: 05 Apr 2025
https://github.com/MishaLaskin/curl
CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning
contrastive-learning contrastive-loss contrastive-predictive-coding curl deep-learning deep-learning-algorithms deep-neural-networks deep-q-learning deep-q-network deep-reinforcement-learning deep-rl deeplearning deeplearning-ai gpu model-free-rl off-policy reinforcement-agents reinforcement-learning reinforcement-learning-algorithms sac
Last synced: 23 Nov 2024
https://github.com/Omegastick/pytorch-cpp-rl
PyTorch C++ Reinforcement Learning
a2c actor-critic advantage-actor-critic continuous-control cplusplus cpp libtorch ppo proximal-policy-optimization pytorch pytorch-cpp-frontend pytorch-rl reinforcement-learning reinforcement-learning-algorithms
Last synced: 07 May 2025
https://github.com/qiwihui/reinforcement-learning-an-introduction-chinese
《Reinforcement Learning: An Introduction》(第二版)中文翻译
reinforcement-learning reinforcement-learning-algorithms sphinx-doc
Last synced: 05 Apr 2025
https://github.com/EricSteinberger/PokerRL
Framework for Multi-Agent Deep Reinforcement Learning in Poker
deep-learning framework gym-environment poker ray reinforcement-learning reinforcement-learning-algorithms research
Last synced: 27 Mar 2025
https://github.com/sforaidl/genrl
A PyTorch reinforcement learning library for generalizable and reproducible algorithm implementations with an aim to improve accessibility in RL
algorithm-implementations benchmarking data-science deep-learning gym hacktoberfest machine-learning neural-network openai python pytorch reinforcement-learning reinforcement-learning-algorithms
Last synced: 04 Apr 2025
https://github.com/SforAiDl/genrl
A PyTorch reinforcement learning library for generalizable and reproducible algorithm implementations with an aim to improve accessibility in RL
algorithm-implementations benchmarking data-science deep-learning gym hacktoberfest machine-learning neural-network openai python pytorch reinforcement-learning reinforcement-learning-algorithms
Last synced: 01 May 2025
https://github.com/pku-alignment/safe-policy-optimization
NeurIPS 2023: Safe Policy Optimization: A benchmark repository for safe reinforcement learning algorithms
benchmarks constrained-reinforcement-learning reinforcement-learning-algorithms safe safe-reinforcement-learning
Last synced: 07 May 2025
https://github.com/huawei-noah/xingtian
xingtian is a componentized library for the development and verification of reinforcement learning algorithms
dqn impala muzero ppo qmix reinforcement-learning-algorithms
Last synced: 05 Apr 2025
https://github.com/nikhilbarhate99/hierarchical-actor-critic-hac-pytorch
PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments
actor-critic deep-reinforcement-learning gym-environment gym-environments hierarchical-reinforcement-learning openai-gym pytorch pytorch-implementation pytorch-rl reinforcement-learning reinforcement-learning-algorithms
Last synced: 09 Apr 2025
https://github.com/nikhilbarhate99/Hierarchical-Actor-Critic-HAC-PyTorch
PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments
actor-critic deep-reinforcement-learning gym-environment gym-environments hierarchical-reinforcement-learning openai-gym pytorch pytorch-implementation pytorch-rl reinforcement-learning reinforcement-learning-algorithms
Last synced: 10 May 2025
https://github.com/opendilab/di-engine-docs
DI-engine docs (Chinese and English)
deep-learning imitation-learning inverse-reinforcement-learning model-based-reinforcement-learning multi-agent-reinforcement-learning offline-rl pytorch-rl reinforcement-learning reinforcement-learning-algorithms
Last synced: 06 Apr 2025
https://github.com/jxareas/machine-learning-notebooks
The full collection of Jupyter Notebook labs from Andrew Ng's Machine Learning Specialization.
clustering deep-learning jupyter-notebook kmeans learn linear-regression logistic-regression machine-learning machine-learning-algorithms neural-network numpy python regression reinforcement-learning reinforcement-learning-algorithms supervised-learning tensorflow unsupervised-learning
Last synced: 16 May 2025
https://github.com/imagry/aleph_star
Reinforcement learning with A* and a deep heuristic
control-systems dqn machine-learning-algorithms optimization-algorithms reinforcement-learning reinforcement-learning-algorithms shortest-path
Last synced: 27 Nov 2024
https://github.com/aurimas13/machine-learning-goodness
The Machine Learning project including ML/DL projects, notebooks, cheat codes of ML/DL, useful information on AI/AGI and codes or snippets/scripts/tasks with tips.
algorithms artifcial-intelligence artificial-intelligence chatgpt cheatsheets computer-science data-science deep-neural-networks deep-reinforcement-learning gpt4 machine-learning machine-learning-algorithms mlops python python3 reinforcement-learning reinforcement-learning-algorithms tips tips-and-tricks
Last synced: 21 Apr 2025
https://github.com/BY571/Soft-Actor-Critic-and-Extensions
PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL + D2RL and parallel Environments.
actor-critic-algorithm continuous d2rl emphasizing-recent-experience multi-environment munchausen munchausen-reinforcement-learning parallel-computing prioritized-experience-replay pytorch reinforcement-learning reinforcement-learning-algorithms sac soft-actor-critic
Last synced: 27 Nov 2024
https://github.com/bentrevett/pytorch-rl
Tutorials for reinforcement learning in PyTorch and Gym by implementing a few of the popular algorithms. [IN PROGRESS]
a2c actor-critic advantage-actor-critic generalized-advantage-estimation policy-gradient pytorch pytorch-implementation pytorch-implmention pytorch-rl pytorch-tutorial pytorch-tutorials reinforcement-learning reinforcement-learning-algorithms rl
Last synced: 27 Mar 2025
https://github.com/binary-husky/hmp2g
Multiagent Reinforcement Learning Research Project
machine-learning reinforcement-learning-algorithms simulation
Last synced: 04 Apr 2025
https://github.com/nasdin/reinforcementlearning-atarigame
Pytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advantage Actor-Critic (A3C) Algorithm. This is much superior and efficient than DQN and obsoletes it. Can play on many games
a3c a3c-lstm actor-critic adam asynchronous-advantage-actor-critic deep-reinforcement-learning lstm openai-gym python pytorch reinforcement-agents reinforcement-learning reinforcement-learning-algorithms rmsprop universe
Last synced: 10 Apr 2025
https://github.com/coax-dev/coax
Modular framework for Reinforcement Learning in python
reinforcement-learning reinforcement-learning-agent reinforcement-learning-algorithms
Last synced: 13 May 2025
https://github.com/kkuette/TradzQAI
Trading environnement for RL agents, backtesting and training.
algorithm backtesting bitcoin bitcoin-bot reinforcement-learning reinforcement-learning-agent reinforcement-learning-algorithms trading trading-algorithms trading-bot trading-env
Last synced: 24 Mar 2025
https://github.com/gordicaleksa/pytorch-learn-reinforcement-learning
A collection of various RL algorithms like policy gradients, DQN and PPO. The goal of this repo will be to make it a go-to resource for learning about RL. How to visualize, debug and solve RL problems. I've additionally included playground.py for learning more about OpenAI gym, etc.
deep-learning deep-q-network dqn jupyter policy-gradient ppo python pytorch pytorch-dqn pytorch-implementation pytorch-policy-gradient pytorch-ppo reinforcement-learning reinforcement-learning-algorithms rl
Last synced: 26 Apr 2025
https://github.com/xiangwang1223/kgpolicy
Reinforced Negative Sampling over Knowledge Graph for Recommendation, WWW2020
explainable-recommendation knowledge-aware-recommendation knowledge-based-recommendation knowledge-graph knowledge-graph-dataset knowledge-graph-for-recommendation negative-sampling recommender-system reinforcement-learning reinforcement-learning-algorithms www2020
Last synced: 20 Dec 2024
https://github.com/BY571/DQN-Atari-Agents
DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow, and DRQN
atari c51 ddqn-pyotrch deep-reinforcement-learning dqn-pytorch drqn dueling-dqn-pytorch multi-environment multi-step-dqn multiprocessing n-step-dqn noisy-dqn openai parallel-computing prioritized-experience-replay rainbow reinforcement-learning-agent reinforcement-learning-algorithms
Last synced: 05 May 2025
https://github.com/activatedgeek/torchrl
Highly Modular and Scalable Reinforcement Learning
deep-learning deep-reinforcement-learning dqn machine-learning policy-gradient python3 pytorch reinforcement-learning reinforcement-learning-algorithms
Last synced: 17 Mar 2025
https://github.com/XinJingHao/TD3-BipedalWalkerHardcore-v2
Solve BipedalWalkerHardcore-v2 with TD3
reinforcement-learning-algorithms robot
Last synced: 28 Nov 2024
https://github.com/yaricom/goneat
The GOLang implementation of NeuroEvolution of Augmented Topologies (NEAT) method to evolve and train Artificial Neural Networks without error back propagation
artificial-neural-networks augmenting-topologies neat neural-network neuroevolution reinforcement-learning reinforcement-learning-algorithms unsupervised-learning unsupervised-machine-learning
Last synced: 06 Apr 2025
https://github.com/namoshizun/pypomdp
Python implementation of POMDP framework and PBVI & POMCP algorithms.
educational pomdp python reinforcement-learning-algorithms
Last synced: 21 Mar 2025
https://github.com/nikhilbarhate99/actor-critic-pytorch
Policy Gradient Actor-Critic PyTorch | Lunar Lander v2
a2c actor-critic deep-reinforcement-learning openai-gym openai-gym-environments policy-gradient pytorch pytorch-implmention pytorch-tutorial reinforcement-learning-algorithms
Last synced: 04 May 2025
https://github.com/diditforlulz273/PokerRL-Omaha
Omaha Poker functionality+some features for PokerRL Reinforcement Learning card framwork
cfr counterfactual-regret-minimization deep-learning monte-carlo-tree-search omaha-poker poker-bot pytorch reinforcement-learning reinforcement-learning-algorithms
Last synced: 27 Mar 2025
https://github.com/ugurkanates/spacexreinforcementlearning
SpaceX Falcon 9 simulated with Reinforcement Learning algorithms such as D4PG,SAC and PPO.
reinforcement-learning reinforcement-learning-algorithms reinforcement-learning-environments rocket spacex
Last synced: 08 Feb 2025
https://github.com/ikostrikov/pytorch-rl
pytorch reinforcement-learning reinforcement-learning-algorithms
Last synced: 30 Apr 2025
https://github.com/yaricom/goneat_ns
This project provides GOLang implementation of Neuro-Evolution of Augmenting Topologies (NEAT) with Novelty Search optimization aimed to solve deceptive tasks with strong local optima
artificial-neural-networks augmenting-topologies explainable-ai explainable-artificial-intelligence golang modular-ai neat neuroevolution novelty-search reinforcement-learning-algorithms unsupervised-learning unsupervised-learning-algorithms unsupervised-machine-learning
Last synced: 05 Apr 2025
https://github.com/xuehaipan/mate
MATE: the Multi-Agent Tracking Environment.
multi-agent-reinforcement-learning openai-gym openai-gym-environment reinforcement-learning reinforcement-learning-algorithms reinforcement-learning-environment
Last synced: 01 May 2025
https://github.com/mazzzystar/qlearningmouse
Cat-and-Mouse game with Reinforcement Learning (Q-Learning).
qlearning-algorithm reinforcement-learning reinforcement-learning-algorithms reinforcement-learning-excercises
Last synced: 28 Apr 2025
https://github.com/diovisgood/intraday
Gym environment which simulates intraday trading
candle candlestick candlestick-chart environment gym gym-environment intraday reinforcement-learning reinforcement-learning-algorithms stream trades trading trading-bot trading-strategies
Last synced: 10 Apr 2025
https://github.com/baskuit/r-nad
Experimentation with Regularized Nash Dynamics on a GPU accelerated game
deepnash multiagent-reinforcement-learning pytorch reinforcement-learning reinforcement-learning-algorithms rnad
Last synced: 30 Jan 2025
https://github.com/mimoralea/king-pong
Deep Reinforcement Learning Pong Agent, King Pong, he's the best
agent deep-learning deep-q-network deep-reinforcement-learning dqn king-pong machine-learning percept q-learning reinforcement-learning reinforcement-learning-algorithms
Last synced: 15 Apr 2025
https://github.com/jason-cky/deeprl-pytorch
Pytorch implementations of various Deep Reinforcement Learning algorithms on pybullet environments.
ddpg ppo pybullet-environments python3 pytorch-implementation reinforcement-learning-algorithms rlbench td3 trpo
Last synced: 20 Nov 2024
https://github.com/v-i-s-h/mab.jl
A Julia Package for providing Multi Armed Bandit Experiments
bandit-experiments exp julia julia-language julia-package julialang mab multi-arm-bandits reinforcement-learning reinforcement-learning-algorithms thompson-sampling ucb
Last synced: 01 May 2025
https://github.com/sadrasabouri/pyrandwalk
:walking:Python Library for Random Walks
education educational markov-chain networkx probabilistic-graphical-models probability python random-walk reinforcement-learning reinforcement-learning-algorithms simulation stochastic-processes
Last synced: 16 Mar 2025
https://github.com/seungjaeryanlee/rl-exploration
Reinforcement Learning papers on exploration methods.
exploration papers reinforcement-learning reinforcement-learning-algorithms research
Last synced: 25 Jan 2025
https://github.com/gibbsbravo/paraphrasee
Paraphrase Generation Using Deep Reinforcement Learning - MSc Thesis
deep-learning deep-reinforcement-learning natural-language-generation natural-language-processing paraphrase-detection paraphrase-generation paraphrase-identification reinforcement-learning reinforcement-learning-agent reinforcement-learning-algorithms reinforcement-learning-environments
Last synced: 12 Apr 2025
https://github.com/ondrejbiza/bandits
Comparison of bandit algorithms from the Reinforcement Learning bible.
machine-learning reinforcement-learning reinforcement-learning-agent reinforcement-learning-algorithms sutton-book
Last synced: 26 Apr 2025
https://github.com/ondrejbiza/racetrack
An environment for tabular Reinforcement Learning agents.
machine-learning reinforcement-learning reinforcement-learning-agent reinforcement-learning-algorithms sutton-book
Last synced: 26 Apr 2025
https://github.com/epignatelli/discovering-reinforcement-learning-algorithms
A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. and Silver, D., 2020. Discovering reinforcement learning algorithms. Advances in Neural Information Processing Systems, 33.
actor-critic general-meta-learning jax lstm meta-learning paper-implementations paper-with-code policy-gradient reinforcement-learning reinforcement-learning-algorithms rnn stax
Last synced: 02 Mar 2025
https://github.com/xuehaipan/soft-actor-critic
PyTorch Implementation of Soft Actor-Critic Algorithm
actor-critic actor-critic-algorithm reinforcement-learning-algorithms soft-actor-critic
Last synced: 07 May 2025
https://github.com/ajaysub110/rlin200lines
PyTorch implementations of Reinforcement Learning algorithms in less than 200 lines
deep-reinforcement-learning dqn machine-learning policy-gradient ppo pytorch-implementations reinforcement-learning reinforcement-learning-algorithms soft-actor-critic
Last synced: 05 Dec 2024
https://github.com/pockerman/qubic_engine
Collection of C++ based algorithms on numerics, statistics, control, reinforcement learning, machine learning and robotics
control cpp cpp17 extended-kalman-filter filtering finite-element-method finite-volume-method gradient-descent kalman-filter machine-learning numerics physics-simulation reinforcement-learning-algorithms robotics statistics
Last synced: 12 Apr 2025
https://github.com/epignatelli/helx
Interoperating between (Deep) Reiforcement Learning libraries
deep-learning flax jax reinforcement-learning reinforcement-learning-algorithms reinforcement-learning-environments rl rl-environments
Last synced: 02 Mar 2025
https://github.com/eantcal/nunn
Collection of Machine Learning Algorithms
cplusplus-17 deep-neural-networks handwritten-digits linux machine-learning machine-learning-algorithms macos mnist modern-cpp multilayer-perceptron multilayer-perceptron-network neural-network ocr-test qlearning qlearning-algorithm reinforcement-learning-algorithms sarsa tictactoe windows xor-problem
Last synced: 20 Nov 2024
https://github.com/baggepinnen/deterministicpolicygradient.jl
Reinforcement learning with Deterministic Policy Gradient methods
algorithm reinforcement-learning reinforcement-learning-algorithms
Last synced: 15 Mar 2025
https://github.com/zrr1999/reinforcement-learning-with-pytorch
莫烦强化学习教程的PyTorch实现
deep-learning pytorch reinforcement-learning reinforcement-learning-algorithms rl
Last synced: 11 Apr 2025
https://github.com/cyrildever/reinforcement-learning-in-golang
Code for the algorithms of the "Reinforcement Learning" book
golang machine-learning reinforcement-learning-algorithms
Last synced: 10 Apr 2025
https://github.com/wjaskowski/mastering-2048
An efficient reinforcement learning algorithm for learning a strategy for game 2048
2048 ntuples reinforcement-learning reinforcement-learning-algorithms td-learning
Last synced: 12 Apr 2025
https://github.com/epignatelli/reinforcement-learning-an-introduction
A python implementation of the concepts in the book "Reinforcement Learning: An Introduction" by R.S. Sutton and A. G. Barto.
dynamic-programming papers-with-code reinforcement-learning reinforcement-learning-algorithms reinforcement-learning-excercises reinforcement-learning-tutorials sutton-barto-book
Last synced: 02 Mar 2025
https://github.com/kyegomez/hindsightreplay
My implementation of Hindsight replay in PyTorch: "Hindsight Experience Replay"
artificial-intelligence machine-learning reinforcement-learning-algorithms reinfrocement-learning
Last synced: 07 May 2025
https://github.com/chudleyj/rl.cpp
Reinforcement Learning for stocks in C++
c-plus-plus cpp cpp11 machine-learning machine-learning-algorithms reinforcement-learning reinforcement-learning-algorithms stock-data stock-price-prediction stock-prices
Last synced: 11 Apr 2025
https://github.com/thomashirtz/soft-actor-critic
Implementation of the Soft Actor Critic algorithm using Pytorch.
actor-critic openai-gym reinforcement-learning reinforcement-learning-algorithms
Last synced: 09 Apr 2025
https://github.com/djbyrne/core_rl
Repo of core reinforcement learning algorithms and explanations using pytorch lightning
pytorch pytorch-lightning reinforcement-learning reinforcement-learning-agent reinforcement-learning-algorithms
Last synced: 30 Jan 2025
https://github.com/ghubnerr/darwin
A thorough exploration of Reinforcement Learning through OpenAI Gymnasiums. Inspired by OpenAI's "Emergent tool use form multi-agent interaction".
deep-q-learning multi-agent-reinforcement-learning openai-gym reinforcement-learning reinforcement-learning-algorithms
Last synced: 30 Dec 2024
https://github.com/ondrejbiza/aamas_19
Source code for the paper "Online Abstraction with MDP Homomorphisms for Deep Learning".
aamas abstraction deep-learning deep-neural-networks reinforcement-learning reinforcement-learning-algorithms
Last synced: 26 Apr 2025
https://github.com/sintefneodroid/aav
Autonomous Aerial Vehicle
aav agent drone machine-learning machine-learning-algorithms ml planning quadcopter reinforcement reinforcement-agents reinforcement-learning reinforcement-learning-agent reinforcement-learning-algorithms reinforcement-learning-playground rl
Last synced: 12 Apr 2025
https://github.com/howl-anderson/q_learning_demo
Show how Q-learning works from scratch
gym-environment q-learning reinforcement-learning reinforcement-learning-algorithms
Last synced: 21 Nov 2024
https://github.com/gokulp01/bluerov2_gym
A Gymnasium environment for simulating and training reinforcement learning agents on the BlueROV2 underwater vehicle.
autonomous-robots bluerov2 gymnasium gymnasium-environment reinforcement-learning reinforcement-learning-algorithms reinforcement-learning-environments robotics robotics-simulation rov underwater-robotics
Last synced: 14 Dec 2024
https://github.com/zeekersky/target_strike_game
Target Strike game is an unity based compititive game. This game is created for CS662 - Mobile VR & AI course offered at IIT Mandi. Here the source files are given.
behavioral-cloning gail game-development mlagents ppo reinforcement-learning-algorithms unity3d
Last synced: 09 Apr 2025
https://github.com/amifunny/reinforce_adventure
This Repository contains my implementation of popular algorithms on popular environments.
actor-critic bandit ddpg dqn dqn-tensorflow gym openai reinforcement-learning reinforcement-learning-algorithms rl
Last synced: 25 Mar 2025
https://github.com/rickstaa/stable-learning-control
A framework for training theoretically stable (and robust) Reinforcement Learning control algorithms.
artificial-intelligence control deep-learning framework gaussian-networks gymnasium machine-learning neural-networks openai-gym reinforcement-learning reinforcement-learning-agents reinforcement-learning-algorithms robustness simulation stability
Last synced: 13 Feb 2025
https://github.com/gabotechs/lazaro
Reinforcement learning framework for implementing custom models on custom environments using state of the art RL algorithms
actor-critic artificial-intelligence deep-learning deep-learning-algorithms deep-q-learning deep-q-learning-network deep-q-network deep-reinforcement-learning ppo reinforcement-learning reinforcement-learning-agent reinforcement-learning-algorithms
Last synced: 14 Apr 2025
https://github.com/jianzhnie/llmtech
LLMTechSite, 专注于通用人工智能领域的技术生态。
aigc diffusion-models llms reinforcement-learning-algorithms
Last synced: 08 Apr 2025
https://github.com/davestroud/q-learning
Q-Learning and Deep Q-Learning Demo
qlearning qlearning-algorithm reinforcement-learning reinforcement-learning-algorithms
Last synced: 12 Apr 2025
https://github.com/epignatelli/human-level-control-through-deep-reinforcement-learning
A jax/stax implementation of: Mnih, V., Kavukcuoglu, K., Silver, D., Rusu, A.A., Veness, J., Bellemare, M.G., Graves, A., Riedmiller, M., Fidjeland, A.K., Ostrovski, G. and Petersen, S., 2015. Human-level control through deep reinforcement learning. nature, 518(7540), pp.529-533.
atari deep-reinforcement-learning deepmind dqn jax papers-with-code reinforcement-learning reinforcement-learning-algorithms stax
Last synced: 02 Mar 2025
https://github.com/openagi/deeprl
Tensorflow framework for deep reinforcement learning
deep-reinforcement-learning reinforcement-learning-algorithms tensorflow-framework
Last synced: 12 Apr 2025
https://github.com/pegah-ardehkhani/shortest-path-using-reinforcement-learning
Solve the shortest path problem using Reinforcement Learning. This project applies RL techniques, such as Q-learning and SARSA(λ), to find optimal routes in a weighted graph, where the algorithm learns to navigate by receiving rewards based on edge distances.
q-learning reinforcement-learning reinforcement-learning-algorithms sarsa sarsa-lambda shortest-path
Last synced: 01 Apr 2025
https://github.com/sapanz/udacity-deep-reinforcement-learning-solution
This repo will cover most of machine learning algorithms with coding examples.
artificial-intelligence artificial-intelligence-algorithms deep-learning machine-learning machine-learning-algorithms machinelearning machinelearning-python q-learning reinforcement-learning reinforcement-learning-algorithms
Last synced: 12 Apr 2025
https://github.com/mardavsj/machine-learning-algorithms
It consists of basic concepts of Machine-Learning with its algorithms.
algorithms machine-learning python reinforcement-learning-algorithms supervised-learning-algorithms unsupervised-learning-algorithms
Last synced: 10 Apr 2025
https://github.com/hectorpulido/easiest-deep-rl-algorithm-with-pytorch
Easiest way to understand reinforcement learning algorithms using pytorch
ai machine-learning pytorch reinforcement-learning reinforcement-learning-algorithms
Last synced: 16 Apr 2025
https://github.com/andri27-ts/classiccartpole
CartPole using Policy Gradient Model Based
cartpole machine-learning neural-network policy-gradient python reinforcement-learning reinforcement-learning-algorithms
Last synced: 15 Mar 2025
https://github.com/pockerman/cuberl
Library for reinforcement learning with c++
cpp pytorch reinforcement-learning reinforcement-learning-algorithms
Last synced: 12 Apr 2025
https://github.com/edoardopona/hex-ai-reinforcement-learning
Reinforcement Learning agents for the game of Hex
deep-learning hex policy-gradient reinforce reinforcement-learning reinforcement-learning-algorithms
Last synced: 10 Jun 2025