{"id":19068728,"url":"https://github.com/machine-learning-tokyo/reinforcement_learning","last_synced_at":"2026-03-08T23:31:54.139Z","repository":{"id":96775601,"uuid":"198333954","full_name":"Machine-Learning-Tokyo/Reinforcement_Learning","owner":"Machine-Learning-Tokyo","description":"Material for MLT Reinforcement Learning workshops and study sessions","archived":false,"fork":false,"pushed_at":"2020-06-20T15:38:37.000Z","size":5251,"stargazers_count":51,"open_issues_count":0,"forks_count":9,"subscribers_count":5,"default_branch":"master","last_synced_at":"2025-04-18T16:26:40.079Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/Machine-Learning-Tokyo.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2019-07-23T02:12:10.000Z","updated_at":"2025-03-21T16:08:38.000Z","dependencies_parsed_at":"2024-02-22T04:15:21.110Z","dependency_job_id":null,"html_url":"https://github.com/Machine-Learning-Tokyo/Reinforcement_Learning","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Machine-Learning-Tokyo%2FReinforcement_Learning","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Machine-Learning-Tokyo%2FReinforcement_Learning/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Machine-Learning-Tokyo%2FReinforcement_Learning/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Machine-Learning-Tokyo%2FReinforcement_Learning/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/Machine-Learning-Tokyo","download_url":"https://codeload.github.com/Machine-Learning-Tokyo/Reinforcement_Learning/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":251321213,"owners_count":21570696,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-11-09T01:11:34.572Z","updated_at":"2026-03-08T23:31:49.107Z","avatar_url":"https://github.com/Machine-Learning-Tokyo.png","language":"Jupyter Notebook","readme":"# Reinforcement_Learning\nMaterial for MLT Reinforcement Learning workshops and study sessions.\n\nAlso, check out our [MLT repo](https://github.com/Machine-Learning-Tokyo/Deep_Reinforcement_Learning) with top Deep RL resources (tutorials, code, books).\n\n\n# RL Interactive Tools\n\n1. Îµ Decay\n2. k-Armed Bandit\n3. Exploration vs Explotation\n\n- Original concept and Python code: [Anugraha Sinha](https://twitter.com/anugrahasinha)\n- Javascript implementation: [Francisco Dalla Rosa Soares](https://twitter.com/dallarosajp)\n\n\n# Intro to Reinforcement Learning – Session #1\n\nby [Anugraha Sinha](https://twitter.com/anugrahasinha)\n\n### [[Meetup]](https://www.meetup.com/Machine-Learning-Tokyo/events/263347323/) \u0026 [[Slides and Code]](https://github.com/Machine-Learning-Tokyo/Reinforcement_Learning/tree/master/session%20%231)\n\n\nPresentation\n1. Introduction to RL\n2. Important elements of an RL problem\n3. Description of Markov Decision Process (MDP) and and Markov Assumption.\n4. Importance of parametrization of State, Action, Reward and Environment.\n5. Model Based and Model Free Methods\n6. Meaning of Control Problem and Evaluation Problem.\n7. Algorithm of Policy Evaluation and Value iteration methods\n\nCode examples\n1. Finding the best route through a maze/obstruction avoidance using policy iteration algorithm.\n2. Above problem statement with value iterations algorithm.\n3. Code exercise\n","funding_links":[],"categories":[],"sub_categories":[],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmachine-learning-tokyo%2Freinforcement_learning","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fmachine-learning-tokyo%2Freinforcement_learning","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmachine-learning-tokyo%2Freinforcement_learning/lists"}