{"id":24512413,"url":"https://github.com/weiyx16/rl_toy_example","last_synced_at":"2025-03-15T10:10:56.297Z","repository":{"id":134507992,"uuid":"227798567","full_name":"weiyx16/RL_Toy_Example","owner":"weiyx16","description":"Deep RL toy example based on gym package with several methods","archived":false,"fork":false,"pushed_at":"2020-01-15T09:17:07.000Z","size":1081,"stargazers_count":3,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"release","last_synced_at":"2025-01-22T00:46:15.874Z","etag":null,"topics":["a2c","a3c","continuous-mountain-car","dqn-tensorflow","gym","mountain-car","qlearning","sarsa","sarsa-lambda"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/weiyx16.png","metadata":{"files":{"readme":"Readme.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2019-12-13T09:03:04.000Z","updated_at":"2023-05-25T11:10:03.000Z","dependencies_parsed_at":"2023-06-16T19:23:35.330Z","dependency_job_id":null,"html_url":"https://github.com/weiyx16/RL_Toy_Example","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/weiyx16%2FRL_Toy_Example","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/weiyx16%2FRL_Toy_Example/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/weiyx16%2FRL_Toy_Example/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/weiyx16%2FRL_Toy_Example/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/weiyx16","download_url":"https://codeload.github.com/weiyx16/RL_Toy_Example/tar.gz/refs/heads/release","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":243713415,"owners_count":20335567,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["a2c","a3c","continuous-mountain-car","dqn-tensorflow","gym","mountain-car","qlearning","sarsa","sarsa-lambda"],"created_at":"2025-01-22T00:46:31.175Z","updated_at":"2025-03-15T10:10:56.272Z","avatar_url":"https://github.com/weiyx16.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Mountain car RL task\n\nEnv from [gym](https://github.com/openai/gym) and compare different methods on two tasks: [Mountain car v0](https://gym.openai.com/envs/MountainCar-v0) and [Continuous mountain car v0](https://gym.openai.com/envs/MountainCarContinuous-v0)  \n\n## Env\n\n+ Ubuntu  \n+ Tensorflow 1.x (test on 1.12.0)  \n+ Gym (test on 0.15.4)\n+ Numpy (test on 1.17.2)  \n+ Matplotlib (test on 3.1.1)\n\n```sh\n# or if you use anaconda, you can directly run: \n$ conda env create -f environment.yml\n```\n\n## Usage\n\n**Notice** if you want to render the video, in beginning of each file:\n```python\nRENDER = True  # Show GUI\n```\n\n### Mountain Car v0\n\nResult: DQN\u003eSarsa-lambda\u003eSarsa~QTable \n\n```sh\n$ cd MountainCar\n# Run QTable\n$ python QTable_MountainCar.py\n# Run Sarsa\n$ python Sarsa_MountainCar.py\n# Run Sarsa-lambda\n$ python Sarsalambda_MountainCar.py\n# Run DQN\n$ python DQN_MountainCar.py\n```\n\n### Continuous Mountain Car v0\n\nResult: A2C\u003eA3C\n\n```sh\n$ cd MountainCar_Continuous\n# Run A2C\n$ python A2C_MountainCar.py\n# Run A3C\n$ python A3C_MountainCar.py\n```\n\n## Citation\n\n```bibtex\n@misc{RLYixuan2020, \n    author = {Yixuan, Wei},\n    title = {Mountain car RL task},\n    howpublished = {\\url{https://github.com/weiyx16/RL_Toy_Example}},\n    year = {2020}\n}\n```\n\n## Thanks\n\n+ https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow  \n+ https://towardsdatascience.com/reinforcement-learning-w-keras-openai-dqns-1eed3a5338c  \n+ https://github.com/stefanbo92/A3C-Continuous  \n+ https://github.com/sudharsan13296/Hands-On-Reinforcement-Learning-With-Python  \n+ https://medium.com/@asteinbach/actor-critic-using-deep-rl-continuous-mountain-car-in-tensorflow-4c1fb2110f7c  \n+ https://towardsdatascience.com/understanding-actor-critic-methods-931b97b6df3f  ","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fweiyx16%2Frl_toy_example","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fweiyx16%2Frl_toy_example","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fweiyx16%2Frl_toy_example/lists"}