https://github.com/nikhilbarhate99/actor-critic-pytorch
Policy Gradient Actor-Critic PyTorch | Lunar Lander v2
https://github.com/nikhilbarhate99/actor-critic-pytorch
a2c actor-critic deep-reinforcement-learning openai-gym openai-gym-environments policy-gradient pytorch pytorch-implmention pytorch-tutorial reinforcement-learning-algorithms
Last synced: 3 months ago
JSON representation
Policy Gradient Actor-Critic PyTorch | Lunar Lander v2
- Host: GitHub
- URL: https://github.com/nikhilbarhate99/actor-critic-pytorch
- Owner: nikhilbarhate99
- License: bsd-3-clause
- Created: 2018-09-03T11:34:40.000Z (about 7 years ago)
- Default Branch: master
- Last Pushed: 2019-05-07T07:50:26.000Z (over 6 years ago)
- Last Synced: 2025-04-08T07:42:59.824Z (6 months ago)
- Topics: a2c, actor-critic, deep-reinforcement-learning, openai-gym, openai-gym-environments, policy-gradient, pytorch, pytorch-implmention, pytorch-tutorial, reinforcement-learning-algorithms
- Language: Python
- Homepage:
- Size: 2.1 MB
- Stars: 72
- Watchers: 0
- Forks: 26
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Actor-Critic
Solution for Lunar Lander environment v2 of Open AI gym.
The algorithm used is actor-critic (vanilla policy gradient with baseline),more info : http://rail.eecs.berkeley.edu/deeprlcourse-fa17/f17docs/lecture_5_actor_critic_pdf.pdf
-> Dependencies:
OpenAI gym
PyTorch 0.4.1
PIL
-> Hyperparameters can be changed by editing them in respective files
-> To train : run train.py
-> Converges within 1500 episodes
-> To test a pretrained model : run test.py
