https://github.com/philipshurpik/reinforce-experiments

Simple implementations of vanilla reinforce (policy gradient) and actor critic methods with numpy and different frameworks
https://github.com/philipshurpik/reinforce-experiments

cartpole deep-learning machine-learning openai-gym pytorch reinforcement-learning tensorflow

Last synced: 12 months ago
JSON representation

Simple implementations of vanilla reinforce (policy gradient) and actor critic methods with numpy and different frameworks

Host: GitHub
URL: https://github.com/philipshurpik/reinforce-experiments
Owner: philipshurpik
Created: 2018-01-23T23:21:05.000Z (over 8 years ago)
Default Branch: master
Last Pushed: 2019-02-05T19:17:37.000Z (over 7 years ago)
Last Synced: 2024-03-15T10:01:17.234Z (over 2 years ago)
Topics: cartpole, deep-learning, machine-learning, openai-gym, pytorch, reinforcement-learning, tensorflow
Language: Python
Size: 20.5 KB
Stars: 2
Watchers: 1
Forks: 3
Open Issues: 0

ecosyste.ms