An open API service indexing awesome lists of open source software.

https://github.com/kirillbobyrev/td-gamma

Empirical Study of TD(γ) Reinforcement Learning algorithm for Value function Estimation
https://github.com/kirillbobyrev/td-gamma

class-project optimization poster reinforcement-learning

Last synced: 3 months ago
JSON representation

Empirical Study of TD(γ) Reinforcement Learning algorithm for Value function Estimation

Awesome Lists containing this project

README

          

# TD(γ)

Empirical Study of **TD(γ)** Reinforcement Learning algorithm for Value
function Estimation.

The studied algorithm was introduced in [TDγ: Re-evaluating Complex Backups in
Temporal Difference
Learning](https://papers.nips.cc/paper/4472-td_gamma-re-evaluating-complex-backups-in-temporal-difference-learning.pdf)
paper by G. Konidaris, S. Niekum and P. Thomas which was presented at NIPS
2011.

Obtained results show that **TD(γ)** was able to outperform its precursors
given a fairly simple environment and policy, but was much more
resource-intensive in terms of memory and computational complexity.

* [Compiled poster](https://kirillbobyrev.github.io/resources/TD-Gamma-Poster.pdf)
* [Code](https://colab.research.google.com/gist/kirillbobyrev/90efb074a4740f8a8a6abf22e58d988a/td-gamma.ipynb)