An open API service indexing awesome lists of open source software.

https://github.com/tirthajyoti/rl_basics

Basic Reinforcement Learning algorithms
https://github.com/tirthajyoti/rl_basics

artificial-intelligence machine-learning machine-learning-algorithms policy-iteration q-learning reinforcement-learning td-learning temporal-differencing-learning value-iteration

Last synced: about 2 months ago
JSON representation

Basic Reinforcement Learning algorithms

Awesome Lists containing this project

README

        

# Reinforcement Learning Basics
[![Binder](https://mybinder.org/badge_logo.svg)](https://mybinder.org/v2/gh/tirthajyoti/RL_basics/master)

### What is reinforcement learning?
Reinforcement Learning(RL) is a type of machine learning technique that enables an agent to learn in an interactive environment by trial and error using feedback from its own actions and experiences.

### Dynamic visualization of the value iteration/utility propagation in a grid world Markov Decision Process
![VI evolution grid world](https://github.com/tirthajyoti/RL_basics/blob/master/Images/Value%20iteration%20visualization%20for%20grid%20world.gif)
### [Basics of Markov Decision Process](https://github.com/tirthajyoti/RL_basics/blob/master/MDP_basics_value_iteration.ipynb)
### [Value iteration](https://github.com/tirthajyoti/RL_basics/blob/master/MDP_VI_PI_Q-learning_AIMA.ipynb)
### [Policy iteration](https://github.com/tirthajyoti/RL_basics/blob/master/MDP_VI_PI_Q-learning_AIMA.ipynb)
### [Q-learning](https://github.com/tirthajyoti/RL_basics/blob/master/MDP_VI_PI_Q-learning_AIMA.ipynb)
### TD-learning