https://github.com/antonio-f/dynamic-programming
Algorithms for Policy Evaluation, Estimation of Action Values, Policy Improvement, Policy Iteration, Truncated Policy Evaluation, Truncated Policy Iteration, Value Iteration . From Udacity's Deep Reinforcement Learning Nanodegree program.
https://github.com/antonio-f/dynamic-programming
action-value-function bellman-equation dynamic-programming frozenlake gym openai-gym policy-evaluation policy-improvement policy-iteration reinforcement-learning state-value-function value-iteration
Last synced: about 1 month ago
JSON representation
Algorithms for Policy Evaluation, Estimation of Action Values, Policy Improvement, Policy Iteration, Truncated Policy Evaluation, Truncated Policy Iteration, Value Iteration . From Udacity's Deep Reinforcement Learning Nanodegree program.
- Host: GitHub
- URL: https://github.com/antonio-f/dynamic-programming
- Owner: antonio-f
- Created: 2019-04-03T20:44:36.000Z (about 6 years ago)
- Default Branch: master
- Last Pushed: 2019-04-03T20:46:29.000Z (about 6 years ago)
- Last Synced: 2025-03-27T15:21:24.442Z (about 2 months ago)
- Topics: action-value-function, bellman-equation, dynamic-programming, frozenlake, gym, openai-gym, policy-evaluation, policy-improvement, policy-iteration, reinforcement-learning, state-value-function, value-iteration
- Language: Jupyter Notebook
- Size: 179 KB
- Stars: 12
- Watchers: 1
- Forks: 4
- Open Issues: 0