https://github.com/tirthajyoti/rl_basics
Basic Reinforcement Learning algorithms
https://github.com/tirthajyoti/rl_basics
artificial-intelligence machine-learning machine-learning-algorithms policy-iteration q-learning reinforcement-learning td-learning temporal-differencing-learning value-iteration
Last synced: about 2 months ago
JSON representation
Basic Reinforcement Learning algorithms
- Host: GitHub
- URL: https://github.com/tirthajyoti/rl_basics
- Owner: tirthajyoti
- License: mit
- Created: 2018-11-13T06:18:54.000Z (over 6 years ago)
- Default Branch: master
- Last Pushed: 2019-06-06T07:26:07.000Z (about 6 years ago)
- Last Synced: 2025-02-25T18:45:21.703Z (4 months ago)
- Topics: artificial-intelligence, machine-learning, machine-learning-algorithms, policy-iteration, q-learning, reinforcement-learning, td-learning, temporal-differencing-learning, value-iteration
- Language: Jupyter Notebook
- Size: 2.29 MB
- Stars: 18
- Watchers: 4
- Forks: 13
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Reinforcement Learning Basics
[](https://mybinder.org/v2/gh/tirthajyoti/RL_basics/master)### What is reinforcement learning?
Reinforcement Learning(RL) is a type of machine learning technique that enables an agent to learn in an interactive environment by trial and error using feedback from its own actions and experiences.### Dynamic visualization of the value iteration/utility propagation in a grid world Markov Decision Process

### [Basics of Markov Decision Process](https://github.com/tirthajyoti/RL_basics/blob/master/MDP_basics_value_iteration.ipynb)
### [Value iteration](https://github.com/tirthajyoti/RL_basics/blob/master/MDP_VI_PI_Q-learning_AIMA.ipynb)
### [Policy iteration](https://github.com/tirthajyoti/RL_basics/blob/master/MDP_VI_PI_Q-learning_AIMA.ipynb)
### [Q-learning](https://github.com/tirthajyoti/RL_basics/blob/master/MDP_VI_PI_Q-learning_AIMA.ipynb)
### TD-learning