{"id":22938031,"url":"https://github.com/notthestallion/rl_models","last_synced_at":"2025-04-01T19:32:48.055Z","repository":{"id":267179989,"uuid":"897240381","full_name":"NotTheStallion/RL_models","owner":"NotTheStallion","description":null,"archived":false,"fork":false,"pushed_at":"2025-01-05T18:37:04.000Z","size":1873,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"master","last_synced_at":"2025-02-07T12:39:42.608Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"apache-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/NotTheStallion.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2024-12-02T09:38:30.000Z","updated_at":"2025-01-05T18:37:07.000Z","dependencies_parsed_at":null,"dependency_job_id":"f23555df-d958-4084-a529-434825ffdb6b","html_url":"https://github.com/NotTheStallion/RL_models","commit_stats":null,"previous_names":["notthestallion/rl_models"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/NotTheStallion%2FRL_models","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/NotTheStallion%2FRL_models/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/NotTheStallion%2FRL_models/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/NotTheStallion%2FRL_models/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/NotTheStallion","download_url":"https://codeload.github.com/NotTheStallion/RL_models/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":246700506,"owners_count":20819886,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-12-14T12:15:39.160Z","updated_at":"2025-04-01T19:32:48.014Z","avatar_url":"https://github.com/NotTheStallion.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# RL_models\n\nThis repository contains implementations of three reinforcement learning (RL) algorithms: Monte Carlo, SARSA, and Q-learning. These algorithms were tested on a grid world environment with two different approaches:\n\n1. **Neutral Position Without Penalty**: In this approach, the neutral positions in the grid world do not penalize the agent. The agent is free to explore the grid without any negative reinforcement from neutral positions.\n\n2. **Neutral Position With Penalty**: In this approach, the neutral positions impose a penalty of -1. This penalty encourages the agent to find the shortest path to the goal, as lingering in neutral positions results in a cumulative negative reward.\n\nThe goal of these experiments is to compare the performance and behavior of the RL algorithms under different conditions and to observe how the penalty influences the agent's pathfinding strategy.\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fnotthestallion%2Frl_models","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fnotthestallion%2Frl_models","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fnotthestallion%2Frl_models/lists"}