{"id":13528431,"url":"https://github.com/kengz/SLM-Lab","last_synced_at":"2025-04-01T13:32:55.053Z","repository":{"id":40336195,"uuid":"105591065","full_name":"kengz/SLM-Lab","owner":"kengz","description":"Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book \"Foundations of Deep Reinforcement Learning\".","archived":false,"fork":false,"pushed_at":"2025-02-16T01:19:34.000Z","size":4279,"stargazers_count":1276,"open_issues_count":16,"forks_count":274,"subscribers_count":46,"default_branch":"master","last_synced_at":"2025-03-31T17:06:49.561Z","etag":null,"topics":["a2c","a3c","benchmark","deep-reinforcement-learning","dqn","policy-gradient","ppo","pytorch","reinforcement-learning","sac"],"latest_commit_sha":null,"homepage":"https://slm-lab.gitbook.io/slm-lab/","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/kengz.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":"CODE_OF_CONDUCT.md","threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2017-10-02T22:20:22.000Z","updated_at":"2025-03-26T15:21:50.000Z","dependencies_parsed_at":"2025-03-31T17:06:42.514Z","dependency_job_id":"7753e630-dfb8-48a8-bc63-5c91826079e1","html_url":"https://github.com/kengz/SLM-Lab","commit_stats":null,"previous_names":[],"tags_count":27,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/kengz%2FSLM-Lab","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/kengz%2FSLM-Lab/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/kengz%2FSLM-Lab/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/kengz%2FSLM-Lab/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/kengz","download_url":"https://codeload.github.com/kengz/SLM-Lab/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":246647756,"owners_count":20811379,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["a2c","a3c","benchmark","deep-reinforcement-learning","dqn","policy-gradient","ppo","pytorch","reinforcement-learning","sac"],"created_at":"2024-08-01T07:00:19.089Z","updated_at":"2025-04-01T13:32:55.026Z","avatar_url":"https://github.com/kengz.png","language":"Python","readme":"# SLM Lab \u003cbr\u003e ![GitHub tag (latest SemVer)](https://img.shields.io/github/tag/kengz/slm-lab) ![CI](https://github.com/kengz/SLM-Lab/workflows/CI/badge.svg) [![Maintainability](https://api.codeclimate.com/v1/badges/20c6a124c468b4d3e967/maintainability)](https://codeclimate.com/github/kengz/SLM-Lab/maintainability) [![Test Coverage](https://api.codeclimate.com/v1/badges/20c6a124c468b4d3e967/test_coverage)](https://codeclimate.com/github/kengz/SLM-Lab/test_coverage)\n\n\n\u003cp align=\"center\"\u003e\n  \u003ci\u003eModular Deep Reinforcement Learning framework in PyTorch.\u003c/i\u003e\n  \u003cbr\u003e\u003cbr\u003e\n  \u003cb\u003eDocumentation:\u003c/b\u003e\u003cbr\u003e\n  \u003ca href=\"https://slm-lab.gitbook.io/slm-lab/\"\u003ehttps://slm-lab.gitbook.io/slm-lab/\u003c/a\u003e\n  \u003cbr\u003e\u003cbr\u003e\n\u003c/p\u003e\n\n\u003eNOTE: the `book` branch has been updated for issue fixes. For the original code in the book _Foundations of Deep Reinforcement Learning_, check out to git tag `v4.1.1`\n\n|||||\n|:---:|:---:|:---:|:---:|\n| ![ppo beamrider](https://user-images.githubusercontent.com/8209263/63994698-689ecf00-caaa-11e9-991f-0a5e9c2f5804.gif) | ![ppo breakout](https://user-images.githubusercontent.com/8209263/63994695-650b4800-caaa-11e9-9982-2462738caa45.gif) | ![ppo kungfumaster](https://user-images.githubusercontent.com/8209263/63994690-60469400-caaa-11e9-9093-b1cd38cee5ae.gif) | ![ppo mspacman](https://user-images.githubusercontent.com/8209263/63994685-5cb30d00-caaa-11e9-8f35-78e29a7d60f5.gif) |\n| BeamRider | Breakout | KungFuMaster | MsPacman |\n| ![ppo pong](https://user-images.githubusercontent.com/8209263/63994680-59b81c80-caaa-11e9-9253-ed98370351cd.gif) | ![ppo qbert](https://user-images.githubusercontent.com/8209263/63994672-54f36880-caaa-11e9-9757-7780725b53af.gif) | ![ppo seaquest](https://user-images.githubusercontent.com/8209263/63994665-4dcc5a80-caaa-11e9-80bf-c21db818115b.gif) | ![ppo spaceinvaders](https://user-images.githubusercontent.com/8209263/63994624-15c51780-caaa-11e9-9c9a-854d3ce9066d.gif) |\n| Pong | Qbert | Seaquest | Sp.Invaders |\n| ![sac ant](https://user-images.githubusercontent.com/8209263/63994867-ff6b8b80-caaa-11e9-971e-2fac1cddcbac.gif) | ![sac halfcheetah](https://user-images.githubusercontent.com/8209263/63994869-01354f00-caab-11e9-8e11-3893d2c2419d.gif) | ![sac hopper](https://user-images.githubusercontent.com/8209263/63994871-0397a900-caab-11e9-9566-4ca23c54b2d4.gif) | ![sac humanoid](https://user-images.githubusercontent.com/8209263/63994883-0befe400-caab-11e9-9bcc-c30c885aad73.gif) |\n| Ant | HalfCheetah | Hopper | Humanoid |\n| ![sac doublependulum](https://user-images.githubusercontent.com/8209263/63994879-07c3c680-caab-11e9-974c-06cdd25bfd68.gif) | ![sac pendulum](https://user-images.githubusercontent.com/8209263/63994880-085c5d00-caab-11e9-850d-049401540e3b.gif) | ![sac reacher](https://user-images.githubusercontent.com/8209263/63994881-098d8a00-caab-11e9-8e19-a3b32d601b10.gif) | ![sac walker](https://user-images.githubusercontent.com/8209263/63994882-0abeb700-caab-11e9-9e19-b59dc5c43393.gif) |\n| Inv.DoublePendulum | InvertedPendulum | Reacher | Walker |\n\n","funding_links":[],"categories":["Libraries","Pytorch \u0026 related libraries｜Pytorch \u0026 相关库","Pytorch \u0026 related libraries","Python","漏洞库_漏洞靶场","Open Source Reinforcement Learning Platforms"],"sub_categories":["Other libraries｜其他库:","Other libraries:","General-Purpose Machine Learning","资源传输下载","Human Computer Interaction"],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fkengz%2FSLM-Lab","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fkengz%2FSLM-Lab","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fkengz%2FSLM-Lab/lists"}