https://github.com/antonio-f/dynamic-programming

Algorithms for Policy Evaluation, Estimation of Action Values, Policy Improvement, Policy Iteration, Truncated Policy Evaluation, Truncated Policy Iteration, Value Iteration . From Udacity's Deep Reinforcement Learning Nanodegree program.
https://github.com/antonio-f/dynamic-programming

action-value-function bellman-equation dynamic-programming frozenlake gym openai-gym policy-evaluation policy-improvement policy-iteration reinforcement-learning state-value-function value-iteration

Last synced: over 1 year ago
JSON representation

Host: GitHub
URL: https://github.com/antonio-f/dynamic-programming
Owner: antonio-f
Created: 2019-04-03T20:44:36.000Z (over 7 years ago)
Default Branch: master
Last Pushed: 2019-04-03T20:46:29.000Z (over 7 years ago)
Last Synced: 2025-03-27T15:21:24.442Z (over 1 year ago)
Topics: action-value-function, bellman-equation, dynamic-programming, frozenlake, gym, openai-gym, policy-evaluation, policy-improvement, policy-iteration, reinforcement-learning, state-value-function, value-iteration
Language: Jupyter Notebook
Size: 179 KB
Stars: 12
Watchers: 1
Forks: 4
Open Issues: 0

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/antonio-f/dynamic-programming

Awesome Lists containing this project