An open API service indexing awesome lists of open source software.

https://github.com/qasimwani/td-learning-openai

Teaching an RL agent to perform a task using Temporal Difference control algorithms.
https://github.com/qasimwani/td-learning-openai

Last synced: 3 months ago
JSON representation

Teaching an RL agent to perform a task using Temporal Difference control algorithms.

Awesome Lists containing this project

README

          

# Temporal Difference Learning (RL)
Teaching an RL agent to perform a task using Temporal Difference control algorithms.

Implemented Sarsa, Q-learning (Sarsa max), & expected sarsa from scratch to teach an RL agent (Taxi-v3) to complete an episode with the maximum expected reward by estimating the optimal policy π.

Gridworld:

`+---------+`

`|R: | : :G|`

`| : | : : |`

`| : : : : |`

`| | : | : |`

`|Y| : |B: |`

`+---------+`