https://github.com/antonio-f/td-methods-sarsa
Temporal Difference methods - A simple implementation of SARSA algorithm applied to OpenAI gym's "CliffWalking" environment.
https://github.com/antonio-f/td-methods-sarsa
101 algorithm cliffwalking gym gym-environment machine-learning openai-gym reinforcement-learning sarsa sarsa-algorithm simple td-methods temporal-difference
Last synced: 8 months ago
JSON representation
Temporal Difference methods - A simple implementation of SARSA algorithm applied to OpenAI gym's "CliffWalking" environment.
- Host: GitHub
- URL: https://github.com/antonio-f/td-methods-sarsa
- Owner: antonio-f
- Created: 2019-07-08T20:49:54.000Z (over 6 years ago)
- Default Branch: master
- Last Pushed: 2019-07-10T07:15:51.000Z (over 6 years ago)
- Last Synced: 2025-02-06T04:46:31.738Z (10 months ago)
- Topics: 101, algorithm, cliffwalking, gym, gym-environment, machine-learning, openai-gym, reinforcement-learning, sarsa, sarsa-algorithm, simple, td-methods, temporal-difference
- Language: Jupyter Notebook
- Size: 248 KB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0