Projects in Awesome Lists tagged with q-learning
A curated list of projects in awesome lists tagged with q-learning .
https://github.com/datawhalechina/easy-rl
强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/
a3c ddpg deep-reinforcement-learning double-dqn dqn dueling-dqn easy-rl imitation-learning policy-gradient ppo q-learning reinforcement-learning sarsa td3
Last synced: 10 May 2025
https://github.com/morvanzhou/reinforcement-learning-with-tensorflow
Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学
a3c actor-critic asynchronous-advantage-actor-critic ddpg deep-deterministic-policy-gradient deep-q-network double-dqn dqn dueling-dqn machine-learning policy-gradient ppo prioritized-replay proximal-policy-optimization q-learning reinforcement-learning sarsa sarsa-lambda tensorflow-tutorials tutorial
Last synced: 13 May 2025
https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow
Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学
a3c actor-critic asynchronous-advantage-actor-critic ddpg deep-deterministic-policy-gradient deep-q-network double-dqn dqn dueling-dqn machine-learning policy-gradient ppo prioritized-replay proximal-policy-optimization q-learning reinforcement-learning sarsa sarsa-lambda tensorflow-tutorials tutorial
Last synced: 30 Mar 2025
https://github.com/aminhp/gym-anytrading
The most simple, flexible, and comprehensive OpenAI Gym trading environment (Approved by OpenAI Gym)
dqn forex gym-environments openai-gym q-learning reinforcement-learning stocks trading trading-algorithms trading-environments
Last synced: 15 May 2025
https://github.com/AminHP/gym-anytrading
The most simple, flexible, and comprehensive OpenAI Gym trading environment (Approved by OpenAI Gym)
dqn forex gym-environments openai-gym q-learning reinforcement-learning stocks trading trading-algorithms trading-environments
Last synced: 05 May 2025
https://github.com/XinJingHao/DRL-Pytorch
Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)
asl c51 categorical-dqn ddpg deep-reinforcement-learning double-dqn dueling-dqn machine-learning noisynet-dqn ppo prioritized-experience-replay pytorch q-learning reinforcement-learning sac td3
Last synced: 04 Mar 2025
https://github.com/vmayoral/basic_reinforcement_learning
An introductory series to Reinforcement Learning (RL) with comprehensive step-by-step tutorials.
ai artificial-intelligence deep-learning deeplearning neural-networks openai-gym q-learning reinforcement-learning tutorial
Last synced: 16 May 2025
https://github.com/pskrunner14/trading-bot
Stock Trading Bot using Deep Q-Learning
ai-agents deep-q-learning q-learning reinforcement-learning stock-price-prediction stock-trading stock-trading-bot trading-algorithms trading-bot
Last synced: 22 Mar 2025
https://github.com/sudharsan13296/Hands-On-Reinforcement-Learning-With-Python
Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow
asynchronous-advantage-actor-critic deep-deterministic-policy-gradient deep-learning-algorithms deep-q-network deep-recurrent-q-network deep-reinforcement-learning double-dqn drqn dueling-dqn hindsight-experience-replay markov-decision-processes monte-carlo openai-gym policy-gradient policy-gradients ppo q-learning reinforcement-learning sarsa trpo
Last synced: 02 Apr 2025
https://github.com/sudharsan13296/hands-on-reinforcement-learning-with-python
Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow
asynchronous-advantage-actor-critic deep-deterministic-policy-gradient deep-learning-algorithms deep-q-network deep-recurrent-q-network deep-reinforcement-learning double-dqn drqn dueling-dqn hindsight-experience-replay markov-decision-processes monte-carlo openai-gym policy-gradient policy-gradients ppo q-learning reinforcement-learning sarsa trpo
Last synced: 04 Apr 2025
https://github.com/omerbsezer/Reinforcement_learning_tutorial_with_demo
Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..
a3c actor-critic deep-reinforcement-learning dyna dynamic-programming imitation-learning machine-learning meta-learning policy-gradient pomdps q-learning reinforcement-learning sarsa td-learning tutorial
Last synced: 19 Jul 2025
https://github.com/omerbsezer/reinforcement_learning_tutorial_with_demo
Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..
a3c actor-critic deep-reinforcement-learning dyna dynamic-programming imitation-learning machine-learning meta-learning policy-gradient pomdps q-learning reinforcement-learning sarsa td-learning tutorial
Last synced: 07 Oct 2025
https://github.com/mpatacchiola/dissecting-reinforcement-learning
Python code, PDFs and resources for the series of posts on Reinforcement Learning which I published on my personal blog
actor-critic deep-reinforcement-learning dissecting-reinforcement-learning drone-landing genetic-algorithm inverted-pendulum markov-chain mountain-car multi-armed-bandit neural-networks q-learning reinforcement-learning sarsa temporal-differencing-learning
Last synced: 04 Apr 2025
https://github.com/glample/Arnold
Arnold - DOOM Agent
artificial-intelligence doom doom-2 neural-network q-learning reinforcement-learning self-playing vizdoom-competition
Last synced: 07 May 2025
https://github.com/VinF/deer
DEEp Reinforcement learning framework
deep-reinforcement-learning policy-gradient q-learning
Last synced: 19 Jul 2025
https://github.com/filangelos/qtrader
Reinforcement Learning for Portfolio Management
algorithmic-trading python q-learning recurrent-neural-networks reinforcement-learning
Last synced: 24 Mar 2025
https://github.com/sudharsan13296/deep-reinforcement-learning-with-python
Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math
a2c a3c actor-critic bellman-equation c51 ddpg deep-learning deep-reinforcement-learning double-dqn dqn inverse-reinforcement-learning openai-gym policy-gradient ppo q-learning reinforcement-learning sac td3 trpo
Last synced: 05 Apr 2025
https://github.com/lucidrains/q-transformer
Implementation of Q-Transformer, Scalable Offline Reinforcement Learning via Autoregressive Q-Functions, out of Google Deepmind
artificial-intelligence attention-mechanisms deep-learning offline-learning q-learning robotics transformers
Last synced: 15 May 2025
https://github.com/vdutor/tf-rex
Play Google Chrome's T-rex game with TensorFlow
deep-reinforcement-learning q-learning t-rex tensorflow
Last synced: 18 Aug 2025
https://github.com/paperspace/dinoruntutorial
Accompanying code for Paperspace tutorial "Build an AI to play Dino Run"
jupyter-notebook keras-tensorflow machine-learning paperspace q-learning reinforcement-learning
Last synced: 06 Apr 2025
https://github.com/Paperspace/DinoRunTutorial
Accompanying code for Paperspace tutorial "Build an AI to play Dino Run"
jupyter-notebook keras-tensorflow machine-learning paperspace q-learning reinforcement-learning
Last synced: 19 Jul 2025
https://github.com/accel-brain/accel-brain-code
The purpose of this repository is to make prototypes as case study in the context of proof of concept(PoC) and research and development(R&D) that I have written in my website. The main research topics are Auto-Encoders in relation to the representation learning, the statistical machine learning for energy-based models, adversarial generation networks(GANs), Deep Reinforcement Learning such as Deep Q-Networks, semi-supervised learning, and neural network language model for natural language processing.
auto-encoder automatic-summarization combinatorial-optimization deep-learning deep-q-network deep-reinforcement-learning energy-based-model generative-adversarial-network lstm multi-agent-reinforcement-learning q-learning quantum-annealing quantum-monte-carlo reinforcement-learning restricted-boltzmann-machine self-supervised-learning semi-supervised-learning simulated-annealing transfer-learning
Last synced: 30 Oct 2025
https://github.com/nuno-faria/tetris-ai
A deep reinforcement learning bot that plays tetris
deep-reinforcement-learning game-ai q-learning tetris
Last synced: 06 Apr 2025
https://github.com/cyoon1729/deep-Q-networks
Implementations of algorithms from the Q-learning family. Implementations inlcude: DQN, DDQN, Dueling DQN, PER+DQN, Noisy DQN, C51
deep-q-network pytorch q-learning reinforcement-learning
Last synced: 19 Jul 2025
https://github.com/cool-rr/marley
A framework for multi-agent reinforcement learning.
ai hacktoberfest keras machine-learning python q-learning reinforcement-learning tensorflow
Last synced: 05 Apr 2025
https://github.com/cool-RR/marley
A framework for multi-agent reinforcement learning.
ai hacktoberfest keras machine-learning python q-learning reinforcement-learning tensorflow
Last synced: 01 May 2025
https://github.com/allenpandas/reinforcement-learning-papers
📚 List of Top-tier Conference Papers on Reinforcement Learning (RL),including: NeurIPS, ICML, AAAI, IJCAI, AAMAS, ICLR, ICRA, etc.
aaai artificial-intelligence deep-reinforcement-learning dqn hierarchical-reinforcement-learning icml ijcai imitation-learning meta-reinforcement-learning multi-agent-reinforcement-learning neurips offline-reinforcement-learning policy-gradient q-learning reinforcement-learning reinforcement-learning-conferences reinforcement-learning-conferences-papers reinforcement-learning-paper reinforcement-learning-papers rl-papers
Last synced: 17 Mar 2025
https://github.com/sea-snell/implicit-language-q-learning
Official code from the paper "Offline RL for Natural Language Generation with Implicit Language Q Learning"
implicit-q-learning iql language-model nlp offline-rl python pytorch q-learning reinforcement-learning
Last synced: 08 May 2025
https://github.com/navjindervirdee/2048-deep-reinforcement-learning
Trained A Convolutional Neural Network To Play 2048 using Deep-Reinforcement Learning
2048-game convolutional-neural-networks deep-q-learning deep-q-network deep-reinforcement-learning neural-network q-learning reinforcement-learning
Last synced: 25 Jul 2025
https://github.com/jczic/micromlp
A micro neural network multilayer perceptron for MicroPython (used on ESP32 and Pycom modules)
ai ann artificial-intelligence deep-learning deeplearning esp32 hc2 lopy machine-learning micropython mlp multilayer-perceptron neural-network neurons predictive-modeling pycom q-learning qlearning wipy
Last synced: 24 Jun 2025
https://github.com/jczic/MicroMLP
A micro neural network multilayer perceptron for MicroPython (used on ESP32 and Pycom modules)
ai ann artificial-intelligence deep-learning deeplearning esp32 hc2 lopy machine-learning micropython mlp multilayer-perceptron neural-network neurons predictive-modeling pycom q-learning qlearning wipy
Last synced: 12 Apr 2025
https://github.com/lucidrains/llama-qrlhf
Implementation of the Llama architecture with RLHF + Q-learning
artificial-intelligence attention deep-learning q-learning
Last synced: 05 Apr 2025
https://github.com/unity-technologies/q-gridworld
Demo project using tabular Q-learning algorithm
q-learning reinforcement-learning unity unity3d
Last synced: 19 Oct 2025
https://github.com/chen0040/java-reinforcement-learning
Package provides java implementation of reinforcement learning algorithms such Q-Learn, R-Learn, SARSA, Actor-Critic
actor-critic java q-learning reinforcement-learning sarsa sarsa-lambda
Last synced: 15 Aug 2025
https://github.com/juliamusic/pianofingering.jl
(Experimental, a lot of bugs) Automatic fingering generator for piano scores, determining optimal fingering using Model-Based Reinforcement Learning, written in the Julia language.
dyna julia julia-language julia-package mdp model-based-reinforcement-learning music21 piano piano-fingering piano-fingers piano-keyboard q-learning reinforcement-learning
Last synced: 31 Oct 2025
https://github.com/agentmaker/paddle-rlbooks
Paddle-RLBooks is a reinforcement learning code study guide based on pure PaddlePaddle.
actor-critic c51 ddpg double-dqn dqn dueling-dqn noisy-dqn nstep-dqn paddlepaddle policy-gradient policy-gradient-with-baseline policy-iteration q-learning reinforce reinforcement-learning sac sarsa td3 value-iteration
Last synced: 02 Jul 2025
https://github.com/ravi72munde/chrome-dino-reinforcement-learning
An RL implementation in Keras
chrome-dino-game convolutional-neural-networks q-learning reinforcement-learning
Last synced: 15 Aug 2025
https://github.com/urinx/reinforcementlearning
Reinforcing Your Learning of Reinforcement Learning
advantage-actor-critic alphago alphago-zero atari-2600 cartpole ddpg doom dqn frozenlake gomoku mcts policy-gradient ppo q-learning reinforcement-learning space-invaders tic-tac-toe
Last synced: 14 Jul 2025
https://github.com/lufficc/dqn
Implementation of q-learning using TensorFlow
machine-learning q-learning reinforcement-learning
Last synced: 06 May 2025
https://github.com/gsurma/deep_traffic
MIT DeepTraffic top 2% solution (75.01 mph) 🚗.
artificial-intelligence deep-traffic dqn hyperparameter-optimization javascript machine-learning q-learning reinforcement-learning traffic
Last synced: 12 May 2025
https://github.com/dmitrijsc/practical-rl
Reinforcement learning in Julia. Solving OpenAI gym.
cross-entropy frozenlake julia julialang mxnet openai openai-gym q-learning reinforcement-learning
Last synced: 10 Apr 2025
https://github.com/rfeinman/tictactoe-reinforcement-learning
Train a tic-tac-toe agent using reinforcement learning.
q-learning reinforcement-learning rl-agents sarsa-learning tic-tac-toe
Last synced: 20 Mar 2025
https://github.com/linesd/tabular-methods
Tabular methods for reinforcement learning
algorithm cliffwalking gridworld gridworld-cliff gridworld-environment policy-evaluation policy-iteration q-learning q-learning-algorithm q-learning-vs-sarsa reinforcement-learning reinforcement-learning-agent reinforcement-learning-algorithms sarsa sarsa-algorithm sarsa-learning tabular-environments tabular-methods tabular-q-learning value-iteration
Last synced: 08 Oct 2025
https://github.com/daugaard/q-learning-simple-game
ai ml q-learning reinforcement-learning ruby rubyml
Last synced: 03 Apr 2026
https://github.com/neymarl/pacman-rl
Implement some reinforcement learning algorithms, test and visualize on Pacman.
actor-critic pacman policy policy-gradient q-learning reinforcement-learning sarsa-lambda
Last synced: 11 Jul 2025
https://github.com/vivek3141/pacman-ai
A.I. plays the original 1980 Pacman using Neuroevolution of Augmenting Topologies and Deep Q Learning
artificial-intelligence deep-q-learning dopamine dqn neat neat-python neural-network neuroevolution pacman python q-learning reinforcement-learning tensorflow tensorflow-rl
Last synced: 01 Apr 2025
https://github.com/stratismarkou/sample-efficient-bayesian-rl
Source for the sample efficient tabular RL submission to the 2019 NIPS workshop on Biological and Artificial RL
bayesian-inference bayesian-methods bayesian-q-learning efficient-rl exploration exploration-exploitation moment-matching posterior-sampling psrl q-learning reinforcement-learning reproducible-research tabular-environments tabular-methods uncertainty-bellman-equation
Last synced: 14 Apr 2025
https://github.com/makaveli10/reinforcementlearning
Reinforcement Learning - Implementation of Exercises, algorithms from the book Sutton Barto and David silver's RL course in Python, OpenAI Gym.
david-silver-course double-dqn dqn dqn-pytorch dynamic-programming experience-replay machine-learning model-free-rl monte-carlo-methods n-step-bootstrapping n-step-sarsa prioritized-experience-replay q-learning q-learning-vs-sarsa random-walk reinforcement-learning sarsa sutton-gambler sutton-gridworld td-learning
Last synced: 18 Mar 2025
https://github.com/mimoralea/king-pong
Deep Reinforcement Learning Pong Agent, King Pong, he's the best
agent deep-learning deep-q-network deep-reinforcement-learning dqn king-pong machine-learning percept q-learning reinforcement-learning reinforcement-learning-algorithms
Last synced: 25 Aug 2025
https://github.com/satwikkansal/q-learning-taxi-v3
Simple q-learning implementation for taxi-v3 environment of Open AI gym.
gym python q-learning reinforcement-learning taxi-v2 taxi-v3
Last synced: 11 Apr 2025
https://github.com/jacobkahn/reinforcement-learning-market-microstructure
Using Q-learning to better navigate orderbooks.
deep-learning financial-markets machine-learning neural-networks orderbook q-learning reinforcement-learning
Last synced: 13 Apr 2025
https://github.com/tirthajyoti/rl_basics
Basic Reinforcement Learning algorithms
artificial-intelligence machine-learning machine-learning-algorithms policy-iteration q-learning reinforcement-learning td-learning temporal-differencing-learning value-iteration
Last synced: 06 May 2025
https://github.com/vivek3141/rubiks-cube-ai
Using Deep Reinforcement Learning, a computer program learns how to solve the Rubik's Cube, the world's most popular toy.
machine-learning q-learning reinforcement-learning rubiks-cube
Last synced: 14 Mar 2025
https://github.com/fszewczyk/rocket-landing-rl
Custom OpenAI Gym for vertical rocket landing and Deep Q-Learning implementation.
deep-q-learning deep-q-network q-learning reinforcement-learning rocket-landing thrust-vector-control
Last synced: 14 Jan 2026
https://github.com/anroshka/snake-ai
🐍 A Snake game AI that learns to play through Deep Q-Learning. Built with PyTorch and Pygame, featuring CUDA acceleration and real-time visualization of the learning process.
artificial-intelligence collaborate collaboration cuda deep-learning deep-q-learning dqn game-ai gpu-acceleration machine-learning neural-network pygame python pytorch q-learning reinforcement-learning snake-game
Last synced: 24 Feb 2026
https://github.com/mtrazzi/gym-alttp-gridworld
A gym environment for Stuart Armstrong's model of a treacherous turn.
demo game gym-environment numpy python3 q-learning reinforcement-learning simulation zelda
Last synced: 12 Apr 2025
https://github.com/mehdishahbazi/dqn-frozenlake-gymnasium
This repo implements Deep Q-Network (DQN) for solving the Frozenlake-v1 environment of the Gymnasium library using Python 3.8 and PyTorch 2.0.1 in both 4x4 and 8x8 map sizes.
deep-learning deep-q-learning deep-q-network deep-reinforcement-learning dqn drl drl-pytorch frozenlake gym gymnasium python pytorch q-learning reinforcement-learning
Last synced: 12 Apr 2025
https://github.com/entrpn/fingym
A tool for developing reinforcement learning algorithms focused in stock prediction
artificial-intelligence artificial-intelligence-algorithms evolution-strategies evolutionary-algorithms evolutionary-strategy python q-learning reinforcement-agents reinforcement-learning reinforcement-learning-environments reinforcement-learning-playground stock-market stock-price-prediction
Last synced: 12 Apr 2025
https://github.com/chen0040/java-reinforcement-learning-flappy-bird
Demo of java-reinforcement-learning library using flappy bird
actor-critic flappy-bird q-learning r-learning reinforcement-learning sarsa
Last synced: 15 Aug 2025
https://github.com/timkoornstra/automatic-piano-fingering
This repository contains the code for automatically generating piano fingerings using a reinforcement learning agent that uses Q-Learning.
fingering piano piano-fingering python q-learning q-learning-algorithm reinforcement-learning
Last synced: 13 Jun 2025
https://github.com/alvinwan/bots-for-atari-games
Bots for Atari Games using Reinforcement Learning
atari-games deep-learning deep-reinforcement-learning frozenlake q-learning reinforcement-learning spaceinvaders
Last synced: 27 Oct 2025
https://github.com/sayakpaul/floydhub-q-learning-blog
Contains the Jupyter Notebook made for a FloydHub article on Q-Learning
pytorch q-learning reinforcement-learning
Last synced: 08 Mar 2026
https://github.com/masouduut94/MCTS-agent-cythonized
MONTE Carlo Tree Search (MCTS) is a method for finding optimal decisions in a given domain by taking random samples in the decision space and building a search tree accordingly. It has already had a profound impact on Artificial Intelligence (AI) approaches for domains that can be represented as trees of sequential decisions, particularly games and planning problems. In this project I used a board game called "HEX" as a platform to test different simulation strategies in MCTS field.
cython last-good-reply markov-decision-processes monte-carlo-simulation monte-carlo-tree-search performance-optimization poolrave q-learning rapid-action-value-estimation reinforcement-learning tkinter
Last synced: 11 Sep 2025
https://github.com/neelsomani/literature
Literature card game implementation: https://en.wikipedia.org/wiki/Literature_(card_game)
artificial-intelligence card-game machine-learning neural-network q-learning
Last synced: 09 Apr 2025
https://github.com/jczic/micronn
Micro neural network with multi-dimensional layers, multi-shaped data, fully or locally meshing, conv2D, unconv2D, Qlearning, ... for test!
artificial-intelligence autoencoder-neural-network convolutional-autoencoder convolutional-neural-networks deep-learning jczic machine-learning mlp-classifier neural-network neural-networks neurons python q-learning qlearning reinforcement-learning
Last synced: 19 Mar 2025
https://github.com/adityajn105/flappy-bird-deep-q-learning
Flappy Bird Game trained on a Double Dueling Deep Q Network with Prioritized Experience Replay
cnn deep-learning dueling-dqn image-processing prioritized-experience-replay q-learning reinforcement-learning torch
Last synced: 29 Oct 2025
https://github.com/juliapomdp/tabulartdlearning.jl
Julia implementations of temporal difference Reinforcement Learning algorithms like Q-Learning and SARSA
q-learning reinforcement-learning sarsa
Last synced: 23 Feb 2025
https://github.com/matin-ghorbani/snake-ai-deep-qlearning
Implement a snake AI with deep Q learning using PyTorch
deep-learning pytorch q-learning snake-ai
Last synced: 28 Feb 2026
https://github.com/aryaminus/srijana
A C++ and Opengl based game to draw a window leading to snake which runs on neural network to reach it's food within certain iteration for AI along with Keyboard press to move the Snake quads. Youtube:
ai artificial-intelligence artificial-intelligence-algorithms artificial-neural-networks cpp freeglut game graphics keyboard linux neural-network opengl perspective q-learning
Last synced: 23 Apr 2025
https://github.com/felipemarcelino/2048-ddqn-per-reinforcement-learning
This project aims apply Dueling Deep Q Learning with Prioritized experience to play game 2048
2048-game double-dqn dueling-q-networks machine-learning prioritized-experience-replay q-learning reinforcement-learning sumtree
Last synced: 12 Apr 2025
https://github.com/msoedov/q-learner
Open ai gym q-learning for SpaceInvaders
Last synced: 06 Sep 2025
https://github.com/chen0040/java-reinforcement-learning-tic-tac-toe
Demo of reinforcement learning using tic-tac-toe
q-learning reinforcement-learning sarsa tic-tac-toe
Last synced: 17 Jul 2025
https://github.com/farshidnooshi/pacman-projects
Principles & Applications of Artificial Intelligence at Amirkabir University of Technology course projects
bayes-network constraint-satisfaction-problem markov-decision-processes mdp pacman-projects q-learning reinforcement-learning sampling search
Last synced: 04 Jul 2025
https://github.com/misaghsoltani/deepcubeai
Learning Discrete World Models for Heuristic Search
a-star-search deep-learning deep-q-network deep-reinforcement-learning deepcubea deepcubeai deepxube heuristic-search heuristic-search-algorithms model-based-reinforcement-learning planning q-learning q-star q-star-search reinforcement-learning representation-learning rlc2024 rubiks-cube sokoban world-model
Last synced: 02 Jul 2025
https://github.com/jihoonerd/rl-maze
Simple maze solver by reinforcement learning
maze policy-gradient q-learning reinforce reinforcement-learning sarsa tutorial
Last synced: 12 Apr 2025
https://github.com/timhanewich/tetris-ai-mini
Training a neural network (AI) to play a very simplified game of 4x4 Tetris using Q-Learning.
ai keras keras-tensorflow machine-learning python q-learning reinforcement-learning tensorflow tetris
Last synced: 11 Oct 2025
https://github.com/sbatururimi/deeplearning
A reference folder mapping some of my deep-learning repositories: RNN, GAN, DCGAN, seq2seq, Transfer learning, Autoencoders, Semi-supervised learning
anaconda autoencoder dcgan deep-learning gan jupyter-notebook neural-network notebook python q-learning rnn semi-supervised-learning transfer-learning
Last synced: 15 Apr 2025
https://github.com/addy1997/rl-algorithms
This repository has RL algorithms implemented using python
double-expected-sarsa double-sarsa epsilon-greedy-exploration expected-sarsa gradient-bandits hacktoberfest hacktoberfest2020 monte-carlo-methods q-learning q-learning-vs-sarsa reinforcement-learning rl-algorithms sarsa
Last synced: 12 Apr 2025
https://github.com/mehdishahbazi/dqn-cliff-walking-gymnasium
This repo implements Deep Q-Network (DQN) for solving the Cliff Walking v0 environment of the Gymnasium library using Python 3.8 and PyTorch 2.0.1 with the finest tuning.
cliff-walking-problem cliffwalking deep-learning deep-q-learning deep-q-network deep-reinforcement-learning dqn drl drl-pytorch gym gymnasium python pytorch q-learning reinforcement-learning
Last synced: 22 Apr 2025
https://github.com/thomas-bouvier/floppy-bird
Flappy Bird-like game including a Q-learning algorithm and a neural network-based algorithm (NEAT) for artificial intelligence
artificial-intelligence evolutionary-algorithm flappy-bird genetic-algorithm neat neural-network q-learning
Last synced: 14 Aug 2025
https://github.com/yashbansod/cmsc_818b_decision_making_in_robotics
Python implementation of projects for Decision making in robotics
decision-making gaussian-processes neural-network planning python q-learning reinforcement-learning robotics tsp
Last synced: 13 May 2025
https://github.com/mehdishahbazi/dqn-mountain-car-gymnasium
This repo implements Deep Q-Network (DQN) for solving the Mountain Car v0 environment (discrete version) of the Gymnasium library using Python 3.8 and PyTorch 2.0.1 with a custom reward function for faster convergence.
deep-learning deep-q-learning deep-q-network deep-reinforcement-learning discrete dqn drl drl-pytorch gym gymnasium mountain-car mountain-car-discrete mountain-car-problem python pytorch q-learning reinforcement-learning
Last synced: 11 Oct 2025
https://github.com/lhvy/taxi-v3-q-learning
A simple Q-learning implementation in OpenAI Gym's "Taxi-v3" environment.
gym machine-learning openai-gym python-3 python3 q-learning reinforcement-learning
Last synced: 25 Jun 2025
https://github.com/garethjns/reinforcement-learning-keras
Reinforcement learning algorithms implemented in Keras (tensorflow==2.3) and sklearn
dqn keras policy-gradient q-learning reinforcement-learning tensorflow2
Last synced: 10 Apr 2025
https://github.com/aryankeluskar/irina.minesweeper
AI that solves Minesweeper. Uses a probabilistic approach, scans the grid, trains a Reinforcement Learning model to take decisions based on the existing grid, and marks the grid with mines to complete the game.
minesweeper pyautogui python q-learning reinforcement-learning
Last synced: 08 Apr 2025
https://github.com/flakas/bubble-shooter-bot
Tensorflow and Q-Learning based Python bot for the Bubble Shooter flash game
bot bubble-shooter keras opencv python q-learning reinforcement-learning tensorflow
Last synced: 10 Apr 2025
https://github.com/nikolazubic/appliedgametheoryhomeworksolutions
Solutions for course: "Applied Game Theory" taken at University of Novi Sad - Faculty of Technical Sciences
applied-game-theory bellman-ford-algorithm blackjack cournot-competition evolutionary-game-theory game-theory instigation-game markov-decision-process minimax-algorithm monte-carlo-methods multi-armed-bandit q-learning sarsa-learning softmax softmax-policy temporal-difference-learning tic-tac-toe
Last synced: 26 Jan 2026
https://github.com/howl-anderson/q_learning_demo
Show how Q-learning works from scratch
gym-environment q-learning reinforcement-learning reinforcement-learning-algorithms
Last synced: 11 Jul 2025
https://github.com/papachristoumarios/ntua-neuro-fuzzy-control
Neuro-Fuzzy Control Assignments for ECE NTUA 9th Semester Course "Neuro-Fuzzy Control and Applications" (ECE1374)
neural-control q-learning reinforcement-learning
Last synced: 02 Mar 2025
https://github.com/rauhul/cs446
Machine Learning UIUC SP 2018
expectation-maximization gaussian-mixture-models generative-adversarial-network k-means linear-regression logistic-regression multiclass-classification neural-network q-learning svm variational-autoencoder
Last synced: 09 Feb 2026
https://github.com/luizparreira/auto_trader
Q-learning trader
cryptocurrency machine-learning q-learning trading
Last synced: 09 Oct 2025
https://github.com/mauroluzzatto/learn-tictactoe-through-self-play
This repository contains a Q-Learning algorithm that learns to play TicTacToe through self-play.
gym gym-environment q-learning reinforcement-learning self-learning tic-tac-toe
Last synced: 28 Jan 2026
https://github.com/pegah-ardehkhani/shortest-path-using-reinforcement-learning
Solve the shortest path problem using Reinforcement Learning. This project applies RL techniques, such as Q-learning and SARSA(λ), to find optimal routes in a weighted graph, where the algorithm learns to navigate by receiving rewards based on edge distances.
q-learning reinforcement-learning reinforcement-learning-algorithms sarsa sarsa-lambda shortest-path
Last synced: 01 Apr 2025
https://github.com/stepantita/q-learning
a Python-based platformer infused with Q-Learning and dynamic level creation from simple JSON files.
epsilon-greedy game-ai machine-learning machine-learning-algorithms platformer-game python q-learning q-learning-algorithm reinforcement-learning reinforcement-learning-algorithms reinforcement-learning-environments reinforcement-learning-playground
Last synced: 17 Jan 2026
https://github.com/parissashahabi/game-playing-intelligent-agent
Implemented reinforcement learning algorithms, including Value-Iteration and Q-Learning, for a 2D grid world Markov Decision Process resembling a Pac-man game. Also applied the Mini-Max algorithm and common path-planning techniques such as A*, Dijkstra, and bidirectional search.
astar-algorithm bidirectional-search dijkstra markov-decision-processes mdp minimax pac-man-game path-planning q-learning reinforcement-learning rl value-iteration
Last synced: 22 Jan 2026
https://github.com/sapanz/udacity-deep-reinforcement-learning-solution
This repo will cover most of machine learning algorithms with coding examples.
artificial-intelligence artificial-intelligence-algorithms deep-learning machine-learning machine-learning-algorithms machinelearning machinelearning-python q-learning reinforcement-learning reinforcement-learning-algorithms
Last synced: 20 Aug 2025
https://github.com/amirreza81/artificial-intelligence
Solutions to practical assignments of Artificial Intelligence course (CE-417) at Sharif University of Technology
a-star a-star-algorithm adversarial-search ai alpha-beta-pruning artificial-intelligence bayesian-networks cryptarithmetic decision-trees deep-learning logistic-regression machine-learning minimax-algorithm pytorch q-learning q-learning-vs-sarsa rainforcement-learning rl simulated-annealing simulated-annealing-algorithm
Last synced: 16 Jun 2025
https://github.com/mauroluzzatto/q-learning-demo-play-nchain
This repository contains a Jupyter Notebook with an implemenation of a Q-Learning Agent, which learns to solve the n-Chain OpenAI Gym environment
demo gym jupyter-notebook openai-gym python q-learning q-learning-algorithm reinforcement-learning
Last synced: 26 Apr 2025
https://github.com/sxv357/inspirit-ai-deep-dive-designing-dl-systems-finalproject-rl-for-autonomous-vehicles
This project uses Reinforcement Learning to teach an agent to drive by itself and learn from its observations so that it can maximize the reward(180+ lines)
deep-q-learning epsilon-greedy exploration-exploitation loss-functions q-learning reinforcement-learning
Last synced: 01 Mar 2025
https://github.com/yenicelik/spaceinvaders--v0
Playing around with OpenAI gym. Just experimenting, no serious competition intended.
ai atari deep-learning openai-gym q-learning reinforcement-learning spaceinvaders
Last synced: 17 Feb 2026