An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with td-learning

A curated list of projects in awesome lists tagged with td-learning .

https://github.com/omerbsezer/Reinforcement_learning_tutorial_with_demo

Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..

a3c actor-critic deep-reinforcement-learning dyna dynamic-programming imitation-learning machine-learning meta-learning policy-gradient pomdps q-learning reinforcement-learning sarsa td-learning tutorial

Last synced: 19 Jul 2025

https://github.com/omerbsezer/reinforcement_learning_tutorial_with_demo

Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..

a3c actor-critic deep-reinforcement-learning dyna dynamic-programming imitation-learning machine-learning meta-learning policy-gradient pomdps q-learning reinforcement-learning sarsa td-learning tutorial

Last synced: 07 Oct 2025

https://github.com/wjaskowski/mastering-2048

An efficient reinforcement learning algorithm for learning a strategy for game 2048

2048 ntuples reinforcement-learning reinforcement-learning-algorithms td-learning

Last synced: 12 Apr 2025

https://github.com/mobeets/value-rnn-td

train an RNN to estimate value in a POMDP using TD learning

pomdp pytorch rnn td-learning

Last synced: 28 Oct 2025

https://github.com/k-karna/reinforcement_learning

Reinforcement Learning Specialization | University of Alberta

q-learning-vs-sarsa reinforcement-learning-algorithms td-learning

Last synced: 28 Jul 2025

https://github.com/silviatulli/rlhomework

multi-armed bandit, gambler problem, cliff problem and TD learning

cliff-problem gambler-problem multi-armed-bandit sequential-decision-making-problems td-learning

Last synced: 01 May 2025

https://github.com/d-dawg78/mva_rl

Master MVA - Reinforcement Learning Project

actor-critic-algorithm echolocation td-learning

Last synced: 04 Apr 2025

https://github.com/dyth/juno

Tic-Tac-Toe agent trained by Deep Reinforcement Learning

deep-reinforcement-learning reinforcement-learning td-lambda td-learning value-network

Last synced: 13 May 2026