An open API service indexing awesome lists of open source software.

https://github.com/adeelh/rl_high_confidence_policy_improvement

High Confidence Policy Improvement for RL (class project for Reinforcement Learning, Fall 2019)
https://github.com/adeelh/rl_high_confidence_policy_improvement

machine-learning optimization reinforcement-learning

Last synced: about 1 month ago
JSON representation

High Confidence Policy Improvement for RL (class project for Reinforcement Learning, Fall 2019)

Awesome Lists containing this project

README

          

# High Confidence Policy Improvement for RL
Given only the data from previous executions of a policy, but not the policy itself, generate a new policy that is probabilistically guaranteed to be better than the old policy.

For details: https://people.cs.umass.edu/~pthomas/papers/Thomas2015b.pdf