{"id":15942024,"url":"https://github.com/amifunny/reinforce_adventure","last_synced_at":"2025-03-25T08:31:12.029Z","repository":{"id":112109043,"uuid":"266299350","full_name":"amifunny/Reinforce_Adventure","owner":"amifunny","description":"This Repository contains my implementation of popular algorithms on popular environments.","archived":false,"fork":false,"pushed_at":"2020-07-16T17:46:38.000Z","size":1238,"stargazers_count":5,"open_issues_count":0,"forks_count":1,"subscribers_count":1,"default_branch":"master","last_synced_at":"2025-03-19T22:53:11.359Z","etag":null,"topics":["actor-critic","bandit","ddpg","dqn","dqn-tensorflow","gym","openai","reinforcement-learning","reinforcement-learning-algorithms","rl"],"latest_commit_sha":null,"homepage":null,"language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/amifunny.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2020-05-23T08:57:54.000Z","updated_at":"2024-08-19T12:36:33.000Z","dependencies_parsed_at":"2023-04-08T21:57:52.044Z","dependency_job_id":null,"html_url":"https://github.com/amifunny/Reinforce_Adventure","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/amifunny%2FReinforce_Adventure","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/amifunny%2FReinforce_Adventure/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/amifunny%2FReinforce_Adventure/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/amifunny%2FReinforce_Adventure/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/amifunny","download_url":"https://codeload.github.com/amifunny/Reinforce_Adventure/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":245426210,"owners_count":20613303,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["actor-critic","bandit","ddpg","dqn","dqn-tensorflow","gym","openai","reinforcement-learning","reinforcement-learning-algorithms","rl"],"created_at":"2024-10-07T07:21:56.350Z","updated_at":"2025-03-25T08:31:12.023Z","avatar_url":"https://github.com/amifunny.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"\n# Reinforce Adventure\nThis Repository contains my implementation of popular algorithms on popular environments.\n\nRepository contains code for - \n\n##  Inverted Pendulum Problem\n\n- [DDPG ( Deep Deterministic Policy Gradient )](https://github.com/amifunny/Reinforce_Adventure/blob/master/DDPG_Keras_Example_wtih_Pendulum.ipynb)\n\tAlso on [official keras-examples](https://keras.io/examples/rl/ddpg_pendulum/)\n\n![GIF for Pendulum](gifs/pendulum.gif)\n\n## Cartpole Problem\n\t\n- [Actor-Critic](https://github.com/amifunny/Reinforce_Adventure/blob/master/ACTOR_CRITIC.py)\n-\t[Monte Carlo Method](https://github.com/amifunny/Reinforce_Adventure/blob/master/Monte_Carlo_Method.py)\n-\t[PPO ( Proximal Policy Optimization )](https://github.com/amifunny/Reinforce_Adventure/blob/master/PPO_Algorithms.py)\n-\t[Q-Learning with Neural Net](https://github.com/amifunny/Reinforce_Adventure/blob/master/Q_Learning_CartPole.py)\n-\t[Vanilla Policy Gradient](https://github.com/amifunny/Reinforce_Adventure/blob/master/Vanilla_policy_Gradient.py)\n\n![GIF for Cartpole](gifs/cartpole.gif)\n\n## Lunar Lander\n\n- [Actor-Critic](https://github.com/amifunny/Reinforce_Adventure/blob/master/Moon_Lander_Discrete.py)\n \n![GIF for LunarLander](gifs/lunar_lander.gif)\n\n## Mountain Car\n\n  - [Q-Learning with Neural Net](https://github.com/amifunny/Reinforce_Adventure/blob/master/Q_Learning_Mountain_CAR.py)\n  \n   ![GIF for MountainCar](gifs/mountain.gif)\n   \n\n## Slot Machine Bandit problem\n\n- [E-greedy \u0026 Thompson Sampling](https://github.com/amifunny/Reinforce_Adventure/blob/master/Multi_Armed_Bandits.py)\n\n![Graph E-greedy vs Thompson](gifs/MultiArmedBandit.png)\n\n\n  \n  \n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Famifunny%2Freinforce_adventure","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Famifunny%2Freinforce_adventure","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Famifunny%2Freinforce_adventure/lists"}