https://github.com/instance01/rtdp

Real-time Dynamic Programming (RTDP)
https://github.com/instance01/rtdp

paper-implementations realtime-dynamic-programming rtdp

Last synced: 3 months ago
JSON representation

Real-time Dynamic Programming (RTDP)

Host: GitHub
URL: https://github.com/instance01/rtdp
Owner: instance01
Created: 2020-06-27T11:58:45.000Z (over 5 years ago)
Default Branch: master
Last Pushed: 2020-06-27T13:10:36.000Z (over 5 years ago)
Last Synced: 2025-04-02T14:21:34.994Z (10 months ago)
Topics: paper-implementations, realtime-dynamic-programming, rtdp
Language: Python
Homepage:
Size: 2.93 KB
Stars: 2
Watchers: 2
Forks: 5
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

## Real-time dynamic programming

Realtime dynamic programming (RTDP) samples paths through the state space based on the current greedy policy and updates the values along its way.
It's an efficient way of real-time planning, since not necessarily the whole state space is visited, and works well for stochastic environments.
This implementation assumes a full observability.

One such stochastic environment is the Frozen Lake environment.
In this repository RTDP is applied to a 20x20 map of said environment.
After training a few minutes, an average reward of 0.48 over 10000 evaluations can be observed.

For more information on RTDP, refer to:
* Learning to act using real-time dynamic programming, 1994, Barto et al
* Planning with Markov Decision Processes: An AI Perspective, 2012, Mausam and Kobolov

Accompanying blog post: [RTDP applied to Frozen Lake](https://blog.xa0.de/post/Real---time-Dynamic-Programming%20%28RTDP%29-applied-to-Frozen-Lake/)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/instance01/rtdp

Awesome Lists containing this project

README