An open API service indexing awesome lists of open source software.

https://github.com/philipshurpik/reinforce-experiments

Simple implementations of vanilla reinforce (policy gradient) and actor critic methods with numpy and different frameworks
https://github.com/philipshurpik/reinforce-experiments

cartpole deep-learning machine-learning openai-gym pytorch reinforcement-learning tensorflow

Last synced: 3 months ago
JSON representation

Simple implementations of vanilla reinforce (policy gradient) and actor critic methods with numpy and different frameworks

Awesome Lists containing this project