{"id":14109676,"url":"https://github.com/dvorjackz/MarioRL","last_synced_at":"2025-08-01T09:31:19.240Z","repository":{"id":37656491,"uuid":"266654885","full_name":"dvorjackz/MarioRL","owner":"dvorjackz","description":"Deep double Q-learning to help Mario beat Super Mario Bros on SNES","archived":false,"fork":false,"pushed_at":"2023-03-24T23:46:48.000Z","size":19418,"stargazers_count":5,"open_issues_count":6,"forks_count":0,"subscribers_count":4,"default_branch":"master","last_synced_at":"2024-08-14T10:15:23.465Z","etag":null,"topics":["double-q-","python","reinforcement-learning","tensorflow"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/dvorjackz.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2020-05-25T01:19:23.000Z","updated_at":"2022-11-07T00:24:04.000Z","dependencies_parsed_at":"2024-01-08T08:00:09.064Z","dependency_job_id":null,"html_url":"https://github.com/dvorjackz/MarioRL","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dvorjackz%2FMarioRL","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dvorjackz%2FMarioRL/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dvorjackz%2FMarioRL/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dvorjackz%2FMarioRL/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/dvorjackz","download_url":"https://codeload.github.com/dvorjackz/MarioRL/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":228360197,"owners_count":17907873,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["double-q-","python","reinforcement-learning","tensorflow"],"created_at":"2024-08-14T10:02:26.667Z","updated_at":"2024-12-05T19:30:42.801Z","avatar_url":"https://github.com/dvorjackz.png","language":"Python","funding_links":[],"categories":["Python"],"sub_categories":[],"readme":"# MarioRL\n\nUsing [Gym Super Mario Bros](https://pypi.org/project/gym-super-mario-bros/) as the environment and [Stable Baselines](https://github.com/hill-a/stable-baselines), a fork of OpenAI's popular [Baselines](https://github.com/openai/baselines) reinforcement learning library, we apply concepts highlighted in recent influential papers in the RL space to traing an agent to beat Super Mario Bros for NES as quickly as possible. \n\nThe final report on our findings is included in the repo as paper.pdf. \n\n## Setup\n*Important*: must use Python version \u003c 3.8, preferrably Python-3.7.6\n```\npython -m venv .env\nsource .env/bin/activate\npip install -r requirements.txt\n```\n\n## Potential PMIX Error Fix\nSolve by using the following line:\n```\nexport PMIX_MCA_gds=hash\n```\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdvorjackz%2FMarioRL","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fdvorjackz%2FMarioRL","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdvorjackz%2FMarioRL/lists"}