An open API service indexing awesome lists of open source software.

https://github.com/silviatulli/rlhomework

multi-armed bandit, gambler problem, cliff problem and TD learning
https://github.com/silviatulli/rlhomework

cliff-problem gambler-problem multi-armed-bandit sequential-decision-making-problems td-learning

Last synced: 17 days ago
JSON representation

multi-armed bandit, gambler problem, cliff problem and TD learning

Awesome Lists containing this project

README

        

## Reinforcement Learning Homework

These exercises are part of the coursework for [Reinforcement Learning](https://fenix.tecnico.ulisboa.pt/disciplinas/TAIA/2019-2020/1-semestre/sumarios).
You will find the implementation of multi-armed bandit, gambler problem, cliff problem and TD learning.

## Reference Book
[Reinforcement Learning: An Introduction](https://web.stanford.edu/class/psych209/Readings/SuttonBartoIPRLBook2ndEd.pdf)