https://github.com/philipshurpik/reinforce-experiments
Simple implementations of vanilla reinforce (policy gradient) and actor critic methods with numpy and different frameworks
https://github.com/philipshurpik/reinforce-experiments
cartpole deep-learning machine-learning openai-gym pytorch reinforcement-learning tensorflow
Last synced: 3 months ago
JSON representation
Simple implementations of vanilla reinforce (policy gradient) and actor critic methods with numpy and different frameworks
- Host: GitHub
- URL: https://github.com/philipshurpik/reinforce-experiments
- Owner: philipshurpik
- Created: 2018-01-23T23:21:05.000Z (over 7 years ago)
- Default Branch: master
- Last Pushed: 2019-02-05T19:17:37.000Z (over 6 years ago)
- Last Synced: 2024-03-15T10:01:17.234Z (over 1 year ago)
- Topics: cartpole, deep-learning, machine-learning, openai-gym, pytorch, reinforcement-learning, tensorflow
- Language: Python
- Size: 20.5 KB
- Stars: 2
- Watchers: 1
- Forks: 3
- Open Issues: 0