{"id":13689251,"url":"https://github.com/rlcode/reinforcement-learning","last_synced_at":"2025-10-03T15:57:01.612Z","repository":{"id":15828598,"uuid":"78835091","full_name":"rlcode/reinforcement-learning","owner":"rlcode","description":"Minimal and Clean Reinforcement Learning Examples","archived":false,"fork":false,"pushed_at":"2023-03-24T21:57:34.000Z","size":63088,"stargazers_count":3534,"open_issues_count":36,"forks_count":738,"subscribers_count":127,"default_branch":"master","last_synced_at":"2025-05-28T22:52:38.330Z","etag":null,"topics":["a3c","actor-critic","deep-learning","deep-q-network","deep-reinforcement-learning","dqn","machine-learning","policy-gradient","reinforcement-learning"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/rlcode.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null}},"created_at":"2017-01-13T09:30:09.000Z","updated_at":"2025-05-28T02:53:41.000Z","dependencies_parsed_at":"2024-01-07T00:07:32.347Z","dependency_job_id":null,"html_url":"https://github.com/rlcode/reinforcement-learning","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/rlcode/reinforcement-learning","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/rlcode%2Freinforcement-learning","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/rlcode%2Freinforcement-learning/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/rlcode%2Freinforcement-learning/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/rlcode%2Freinforcement-learning/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/rlcode","download_url":"https://codeload.github.com/rlcode/reinforcement-learning/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/rlcode%2Freinforcement-learning/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":278187830,"owners_count":25944822,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-10-03T02:00:06.070Z","response_time":53,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["a3c","actor-critic","deep-learning","deep-q-network","deep-reinforcement-learning","dqn","machine-learning","policy-gradient","reinforcement-learning"],"created_at":"2024-08-02T15:01:40.114Z","updated_at":"2025-10-03T15:57:00.941Z","avatar_url":"https://github.com/rlcode.png","language":"Python","funding_links":[],"categories":["Python","Table of Contents","Codes","🤖 AI \u0026 Machine Learning"],"sub_categories":[],"readme":"\u003cp align=\"center\"\u003e\u003cimg width=\"90%\" src=\"images/Reinforcement-Learning.png\"\u003e\u003c/p\u003e\n\n--------------------------------------------------------------------------------\n\n\u003e Minimal and clean examples of reinforcement learning algorithms presented by [RLCode](https://rlcode.github.io) team. [[한국어]](https://github.com/rlcode/reinforcement-learning-kr)\n\u003e\n\u003e Maintainers - [Woongwon](https://github.com/dnddnjs), [Youngmoo](https://github.com/zzing0907), [Hyeokreal](https://github.com/Hyeokreal), [Uiryeong](https://github.com/wooridle), [Keon](https://github.com/keon)\n\nFrom the basics to deep reinforcement learning, this repo provides easy-to-read code examples. One file for each algorithm.\nPlease feel free to create a [Pull Request](https://github.com/rlcode/reinforcement-learning/pulls), or open an [issue](https://github.com/rlcode/reinforcement-learning/issues)!\n\n## Dependencies\n1. Python 3.5\n2. Tensorflow 1.0.0\n3. Keras\n4. numpy\n5. pandas\n6. matplot\n7. pillow\n8. Skimage\n9. h5py\n\n### Install Requirements\n```\npip install -r requirements.txt\n```\n\n## Table of Contents\n\n**Grid World** - Mastering the basics of reinforcement learning in the simplified world called \"Grid World\"\n\n- [Policy Iteration](./1-grid-world/1-policy-iteration)\n- [Value Iteration](./1-grid-world/2-value-iteration)\n- [Monte Carlo](./1-grid-world/3-monte-carlo)\n- [SARSA](./1-grid-world/4-sarsa)\n- [Q-Learning](./1-grid-world/5-q-learning)\n- [Deep SARSA](./1-grid-world/6-deep-sarsa)\n- [REINFORCE](./1-grid-world/7-reinforce)\n\n**CartPole** - Applying deep reinforcement learning on basic Cartpole game.\n\n- [Deep Q Network](./2-cartpole/1-dqn)\n- [Double Deep Q Network](./2-cartpole/2-double-dqn)\n- [Policy Gradient](./2-cartpole/3-reinforce)\n- [Actor Critic (A2C)](./2-cartpole/4-actor-critic)\n- [Asynchronous Advantage Actor Critic (A3C)](./2-cartpole/5-a3c)\n\n**Atari** - Mastering Atari games with Deep Reinforcement Learning\n\n- **Breakout** - [DQN](./3-atari/1-breakout/breakout_dqn.py), [DDQN](./3-atari/1-breakout/breakout_ddqn.py) [Dueling DDQN](./3-atari/1-breakout/breakout_ddqn.py) [A3C](./3-atari/1-breakout/breakout_a3c.py)\n- **Pong** - [Policy Gradient](./3-atari/2-pong/pong_reinforce.py)\n\n**OpenAI GYM** - [WIP]\n\n- Mountain Car - [DQN](./4-gym/1-mountaincar)\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Frlcode%2Freinforcement-learning","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Frlcode%2Freinforcement-learning","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Frlcode%2Freinforcement-learning/lists"}