Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

awesome-reinforcement-learning-lib

GitHub's code repository is all you need
https://github.com/wwxFromTju/awesome-reinforcement-learning-lib

![GitHub stars - project/ray/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/ray-project/ray?label=last%20update) | [ray-rllib](https://github.com/ray-project/ray/) | pytorch, tensorflow-2.x |
![GitHub stars - commit/openai/baselines?label=last%20update) | [baselines](https://github.com/openai/baselines/) | tesorflow-1.x |
![GitHub stars - commit/google/dopamine?label=last%20update) | [dopamine](https://github.com/google/dopamine/) | tensorflow-2.x, tesorflow-1.x |
![GitHub stars - commit/openai/spinningup?label=last%20update) | [spinningup](https://github.com/openai/spinningup/) | pytorch, tesorflow-1.x |
![GitHub stars - commit/tensorlayer/TensorLayer?label=last%20update) | [TensorLayer](https://github.com/tensorlayer/TensorLayer/) | tensorflow-2.x |
![GitHub stars - ml/tianshou/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/thu-ml/tianshou?label=last%20update) | [tianshou](https://github.com/thu-ml/tianshou/) | pytorch |
![GitHub stars - rl/keras-rl/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/keras-rl/keras-rl?label=last%20update) | [keras-rl](https://github.com/keras-rl/keras-rl/) | keras |
![GitHub stars - RM/stable-baselines3/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/DLR-RM/stable-baselines3?label=last%20update) | [stable-baselines3](https://github.com/DLR-RM/stable-baselines3/) | pytorch |
![GitHub stars - christ/Deep-Reinforcement-Learning-Algorithms-with-PyTorch/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/p-christ/Deep-Reinforcement-Learning-Algorithms-with-PyTorch?label=last%20update) | [Deep-Reinforcement-Learning-Algorithms-with-PyTorch](https://github.com/p-christ/Deep-Reinforcement-Learning-Algorithms-with-PyTorch/) | pytorch |
![GitHub stars - commit/deepmind/open_spiel?label=last%20update) | [open_spiel](https://github.com/deepmind/open_spiel/) | pytorch, tensorflow-2.x |
![GitHub stars - commit/facebookresearch/ReAgent?label=last%20update) | [ReAgent](https://github.com/facebookresearch/ReAgent/) | pytorch |
![GitHub stars - commit/kwai/DouZero?label=last%20update) | [DouZero](https://github.com/kwai/DouZero/) | pytorch |
![GitHub stars - commit/tensorforce/tensorforce?label=last%20update) | [tensorforce](https://github.com/tensorforce/tensorforce/) | tensorflow-2.x |
![GitHub stars - commit/deepmind/acme?label=last%20update) | [acme](https://github.com/deepmind/acme/) | jax, tensorflow-2.x |
![GitHub stars - a2c-ppo-acktr-gail/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/ikostrikov/pytorch-a2c-ppo-acktr-gail?label=last%20update) | [pytorch-a2c-ppo-acktr-gail](https://github.com/ikostrikov/pytorch-a2c-ppo-acktr-gail/) | pytorch |
![GitHub stars - commit/deepmind/trfl?label=last%20update) | [trfl](https://github.com/deepmind/trfl/) | tensorflow-2.x, tesorflow-1.x |
![GitHub stars - commit/PaddlePaddle/PARL?label=last%20update) | [PARL](https://github.com/PaddlePaddle/PARL/) | paddle, pytorch |
![GitHub stars - Foundation/ElegantRL/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/AI4Finance-Foundation/ElegantRL?label=last%20update) | [ElegantRL](https://github.com/AI4Finance-Foundation/ElegantRL/) | pytorch |
![GitHub stars - commit/tensorflow/agents?label=last%20update) | [agents](https://github.com/tensorflow/agents/) | tensorflow-2.x, tesorflow-1.x |
![GitHub stars - engine/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/opendilab/DI-engine?label=last%20update) | [DI-engine](https://github.com/opendilab/DI-engine/) | pytorch |
![GitHub stars - commit/vwxyzjn/cleanrl?label=last%20update) | [cleanrl](https://github.com/vwxyzjn/cleanrl/) | pytorch |
![GitHub stars - commit/IntelLabs/coach?label=last%20update) | [coach](https://github.com/IntelLabs/coach/) | tesorflow-1.x |
![GitHub stars - commit/datamllab/rlcard?label=last%20update) | [rlcard](https://github.com/datamllab/rlcard/) | pytorch |
![GitHub stars - berkeley/rlkit/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/rail-berkeley/rlkit?label=last%20update) | [rlkit](https://github.com/rail-berkeley/rlkit/) | pytorch |
![GitHub stars - commit/astooke/rlpyt?label=last%20update) | [rlpyt](https://github.com/astooke/rlpyt/) | pytorch |
![GitHub stars - commit/rlworkgroup/garage?label=last%20update) | [garage](https://github.com/rlworkgroup/garage/) | tensorflow-2.x |
![GitHub stars - Lab/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/kengz/SLM-Lab?label=last%20update) | [SLM-Lab](https://github.com/kengz/SLM-Lab/) | pytorch |
![GitHub stars - commit/chainer/chainerrl?label=last%20update) | [chainerrl](https://github.com/chainer/chainerrl/) | chainer |
![GitHub stars - commit/facebookresearch/rl?label=last%20update) | [rl](https://github.com/facebookresearch/rl/) | pytorch |
![GitHub stars - commit/pfnet/pfrl?label=last%20update) | [pfrl](https://github.com/pfnet/pfrl/) | pytorch |
![GitHub stars - commit/deepmind/rlax?label=last%20update) | [rlax](https://github.com/deepmind/rlax/) | jax |
![GitHub stars - research/batch-ppo/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/google-research/batch-ppo?label=last%20update) | [batch-ppo](https://github.com/google-research/batch-ppo/) | tesorflow-1.x |
![GitHub stars - commit/deepmind/scalable_agent?label=last%20update) | [scalable_agent](https://github.com/deepmind/scalable_agent/) | tesorflow-1.x |
![GitHub stars - commit/takuseno/d3rlpy?label=last%20update) | [d3rlpy](https://github.com/takuseno/d3rlpy/) | pytorch |
![GitHub stars - research/seed_rl/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/google-research/seed_rl?label=last%20update) | [seed_rl](https://github.com/google-research/seed_rl/) | tensorflow-2.x |
![GitHub stars - lib/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/facebookresearch/mbrl-lib?label=last%20update) | [mbrl-lib](https://github.com/facebookresearch/mbrl-lib/) | pytorch |
![GitHub stars - commit/facebookresearch/torchbeast?label=last%20update) | [torchbeast](https://github.com/facebookresearch/torchbeast/) | pytorch |
![GitHub stars - rl/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/MushroomRL/mushroom-rl?label=last%20update) | [mushroom-rl](https://github.com/MushroomRL/mushroom-rl/) | pytorch |
![GitHub stars - commit/deepmind/reverb?label=last%20update) | [reverb](https://github.com/deepmind/reverb/) | jax, tensorflow-2.x |
![GitHub stars - commit/NVlabs/GA3C?label=last%20update) | [GA3C](https://github.com/NVlabs/GA3C/) | tesorflow-1.x |
![GitHub stars - learning-library/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/cpnota/autonomous-learning-library?label=last%20update) | [autonomous-learning-library](https://github.com/cpnota/autonomous-learning-library/) | pytorch |
![GitHub stars - ai/CORL/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/tinkoff-ai/CORL?label=last%20update) | [CORL](https://github.com/tinkoff-ai/CORL/) | pytorch |
![GitHub stars - petrenko/sample-factory/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/alex-petrenko/sample-factory?label=last%20update) | [sample-factory](https://github.com/alex-petrenko/sample-factory/) | pytorch |
![GitHub stars - starter-files/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/lcswillems/rl-starter-files?label=last%20update) | [rl-starter-files](https://github.com/lcswillems/rl-starter-files/) | pytorch |
![GitHub stars - commit/VinF/deer?label=last%20update) | [deer](https://github.com/VinF/deer/) | tensorflow-2.x |
![GitHub stars - commit/SurrealAI/surreal?label=last%20update) | [surreal](https://github.com/SurrealAI/surreal/) | pytorch |
![GitHub stars - commit/medipixel/rl_algorithms?label=last%20update) | [rl_algorithms](https://github.com/medipixel/rl_algorithms/) | pytorch |
![GitHub stars - commit/dongminlee94/deep_rl?label=last%20update) | [deep_rl](https://github.com/dongminlee94/deep_rl/) | pytorch |
![GitHub stars - commit/ikostrikov/jaxrl?label=last%20update) | [jaxrl](https://github.com/ikostrikov/jaxrl/) | jax |
![GitHub stars - Reinforcement-Learning-Algorithms/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/Rafael1s/Deep-Reinforcement-Learning-Algorithms?label=last%20update) | [Deep-Reinforcement-Learning-Algorithms](https://github.com/Rafael1s/Deep-Reinforcement-Learning-Algorithms/) | pytorch |
![GitHub stars - agents/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/eleurent/rl-agents?label=last%20update) | [rl-agents](https://github.com/eleurent/rl-agents/) | pytorch |
![GitHub stars - research/batch_rl/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/google-research/batch_rl?label=last%20update) | [batch_rl](https://github.com/google-research/batch_rl/) | tensorflow-2.x |
![GitHub stars - commit/StepNeverStop/RLs?label=last%20update) | [RLs](https://github.com/StepNeverStop/RLs/) | pytorch |
![GitHub stars - commit/facebookresearch/salina?label=last%20update) | [salina](https://github.com/facebookresearch/salina/) | pytorch |
![GitHub stars - commit/Denys88/rl_games?label=last%20update) | [rl_games](https://github.com/Denys88/rl_games/) | pytorch |
![GitHub stars - commit/edbeeching/godot_rl_agents?label=last%20update) | [godot_rl_agents](https://github.com/edbeeching/godot_rl_agents/) | pytorch |
![GitHub stars - commit/SforAiDl/genrl?label=last%20update) | [genrl](https://github.com/SforAiDl/genrl/) | pytorch |
![GitHub stars - commit/fabiopardo/tonic?label=last%20update) | [tonic](https://github.com/fabiopardo/tonic/) | pytorch, tensorflow-2.x |
![GitHub stars - commit/zuoxingdong/lagom?label=last%20update) | [lagom](https://github.com/zuoxingdong/lagom/) | pytorch |
![GitHub stars - marl/malib/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/sjtu-marl/malib?label=last%20update) | [malib](https://github.com/sjtu-marl/malib/) | pytorch |
![GitHub stars - commit/iffiX/machin?label=last%20update) | [machin](https://github.com/iffiX/machin/) | pytorch |
![GitHub stars - commit/kakaoenterprise/JORLDY?label=last%20update) | [JORLDY](https://github.com/kakaoenterprise/JORLDY/) | pytorch |
![GitHub stars - commit/rlgraph/rlgraph?label=last%20update) | [rlgraph](https://github.com/rlgraph/rlgraph/) | pytorch, tesorflow-1.x |
![GitHub stars - commit/facebookresearch/rlmeta?label=last%20update) | [rlmeta](https://github.com/facebookresearch/rlmeta/) | pytorch |
![GitHub stars - research/url_benchmark/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/rll-research/url_benchmark?label=last%20update) | [url_benchmark](https://github.com/rll-research/url_benchmark/) | pytorch |
![GitHub stars - agents/epymarl/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/uoe-agents/epymarl?label=last%20update) | [epymarl](https://github.com/uoe-agents/epymarl/) | pytorch |
![GitHub stars - noah/xingtian/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/huawei-noah/xingtian?label=last%20update) | [xingtian](https://github.com/huawei-noah/xingtian/) | tesorflow-1.x |
![GitHub stars - commit/DeNA/HandyRL?label=last%20update) | [HandyRL](https://github.com/DeNA/HandyRL/) | pytorch |
![GitHub stars - commit/facebookresearch/rlstructures?label=last%20update) | [rlstructures](https://github.com/facebookresearch/rlstructures/) | pytorch |
![GitHub stars - commit/RITCHIEHuang/DeepRL_Algorithms?label=last%20update) | [DeepRL_Algorithms](https://github.com/RITCHIEHuang/DeepRL_Algorithms/) | pytorch, tensorflow-2.x |
![GitHub stars - actively/pymdp/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/infer-actively/pymdp?label=last%20update) | [pymdp](https://github.com/infer-actively/pymdp/) | numpy |
![GitHub stars - Baselines-Team/stable-baselines/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/Stable-Baselines-Team/stable-baselines?label=last%20update) | [stable-baselines](https://github.com/Stable-Baselines-Team/stable-baselines/) | tesorflow-1.x |
![GitHub stars - abel/simple_rl/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/david-abel/simple_rl?label=last%20update) | [simple_rl](https://github.com/david-abel/simple_rl/) | numpy |
![GitHub stars - commit/HorizonRobotics/alf?label=last%20update) | [alf](https://github.com/HorizonRobotics/alf/) | pytorch, Tensorflow 2.1 |
![GitHub stars - rl/tmrl/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/trackmania-rl/tmrl?label=last%20update) | [tmrl](https://github.com/trackmania-rl/tmrl/) | pytorch |
![GitHub stars - commit/Alfredvc/paac?label=last%20update) | [paac](https://github.com/Alfredvc/paac/) | tesorflow-1.x |
![GitHub stars - commit/heronsystems/adeptRL?label=last%20update) | [adeptRL](https://github.com/heronsystems/adeptRL/) | pytorch |
![GitHub stars - baselines/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/twni2016/pomdp-baselines?label=last%20update) | [pomdp-baselines](https://github.com/twni2016/pomdp-baselines/) | pytorch |
![GitHub stars - SM/skrl/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/Toni-SM/skrl?label=last%20update) | [skrl](https://github.com/Toni-SM/skrl/) | pytorch |
![GitHub stars - research/ape-x/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/uber-research/ape-x?label=last%20update) | [ape-x](https://github.com/uber-research/ape-x/) | tesorflow-1.x |
![GitHub stars - commit/facebookresearch/mtrl?label=last%20update) | [mtrl](https://github.com/facebookresearch/mtrl/) | pytorch |
![GitHub stars - commit/alibaba/EasyReinforcementLearning?label=last%20update) | [EasyReinforcementLearning](https://github.com/alibaba/EasyReinforcementLearning/) | tesorflow-1.x |
![GitHub stars - commit/RchalYang/torchrl?label=last%20update) | [torchrl](https://github.com/RchalYang/torchrl/) | pytorch |
![GitHub stars - commit/inspirai/TimeChamber?label=last%20update) | [TimeChamber](https://github.com/inspirai/TimeChamber/) | pytorch |
![GitHub stars - research/rlds/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/google-research/rlds?label=last%20update) | [rlds](https://github.com/google-research/rlds/) | tensorflow-2.x |
![GitHub stars - dev/coax/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/coax-dev/coax?label=last%20update) | [coax](https://github.com/coax-dev/coax/) | jax |
![GitHub stars - ailab/tleague_projpage/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/tencent-ailab/tleague_projpage?label=last%20update) | [tleague_projpage](https://github.com/tencent-ailab/tleague_projpage/) | tesorflow-1.x |
![GitHub stars - py/rlberry/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/rlberry-py/rlberry?label=last%20update) | [rlberry](https://github.com/rlberry-py/rlberry/) | jax, pytorch |
![GitHub stars - commit/Ericonaldo/ILSwiss?label=last%20update) | [ILSwiss](https://github.com/Ericonaldo/ILSwiss/) | pytorch |
![GitHub stars - commit/google/deluca?label=last%20update) | [deluca](https://github.com/google/deluca/) | jax |
![GitHub stars - rl/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/sony/nnabla-rl?label=last%20update) | [nnabla-rl](https://github.com/sony/nnabla-rl/) | nnabla |
![GitHub stars - pytorch/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/schatty/d4pg-pytorch?label=last%20update) | [d4pg-pytorch](https://github.com/schatty/d4pg-pytorch/) | pytorch |
![GitHub stars - commit/ethanluoyc/magi?label=last%20update) | [magi](https://github.com/ethanluoyc/magi/) | jax |
![GitHub stars - commit/spitis/mrl?label=last%20update) | [mrl](https://github.com/spitis/mrl/) | pytorch |
![GitHub stars - commit/leggedrobotics/rsl_rl?label=last%20update) | [rsl_rl](https://github.com/leggedrobotics/rsl_rl/) | pytorch |
![GitHub stars - commit/cyoon1729/distributedRL?label=last%20update) | [distributedRL](https://github.com/cyoon1729/distributedRL/) | pytorch |
![GitHub stars - commit/araffin/sbx?label=last%20update) | [sbx](https://github.com/araffin/sbx/) | jax |
![GitHub stars - commit/facebookresearch/rela?label=last%20update) | [rela](https://github.com/facebookresearch/rela/) | pytorch |
![GitHub stars - lab/RLHive/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/chandar-lab/RLHive?label=last%20update) | [RLHive](https://github.com/chandar-lab/RLHive/) | torch |
![GitHub stars - research/deep_ope/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/google-research/deep_ope?label=last%20update) | [deep_ope](https://github.com/google-research/deep_ope/) | tensorflow-2.x |
![GitHub stars - commit/ku2482/rljax?label=last%20update) | [rljax](https://github.com/ku2482/rljax/) | jax |
![GitHub stars - commit/qlan3/Explorer?label=last%20update) | [Explorer](https://github.com/qlan3/Explorer/) | pytorch |
![GitHub stars - commit/x35f/unstable_baselines?label=last%20update) | [unstable_baselines](https://github.com/x35f/unstable_baselines/) | tensorflow-2.x |
![GitHub stars - prior/jax-rl/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/henry-prior/jax-rl?label=last%20update) | [jax-rl](https://github.com/henry-prior/jax-rl/) | jax |
![GitHub stars - commit/horoiwa/deep_reinforcement_learning_gallery?label=last%20update) | [deep_reinforcement_learning_gallery](https://github.com/horoiwa/deep_reinforcement_learning_gallery/) | tensorflow-2.x |
![GitHub stars - h/cpprb/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/ymd-h/cpprb?label=last%20update) | [cpprb](https://github.com/ymd-h/cpprb/) | |
![GitHub stars - reinforcement-learning/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/google/simple-reinforcement-learning?label=last%20update) | [simple-reinforcement-learning](https://github.com/google/simple-reinforcement-learning/) | tesorflow-1.x |
![GitHub stars - sikchi/safeRL/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/hari-sikchi/safeRL?label=last%20update) | [safeRL](https://github.com/hari-sikchi/safeRL/) | pytorch |
![GitHub stars - commit/stepjam/YARR?label=last%20update) | [YARR](https://github.com/stepjam/YARR/) | pytorch |
![GitHub stars - commit/clvoloshin/COBS?label=last%20update) | [COBS](https://github.com/clvoloshin/COBS/) | pytorch, tensorflow-2.x |
![GitHub stars - Digital-Brain-Laboratory/DB-Football/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/Shanghai-Digital-Brain-Laboratory/DB-Football?label=last%20update) | [DB-Football](https://github.com/Shanghai-Digital-Brain-Laboratory/DB-Football) | pytorch |
![GitHub stars - commit/0xangelo/raylab?label=last%20update) | [raylab](https://github.com/0xangelo/raylab/) | pytorch |
![GitHub stars - commit/instadeepai/fastpbrl?label=last%20update) | [fastpbrl](https://github.com/instadeepai/fastpbrl/) | jax, pytorch |
![GitHub stars - edge/QuaRL/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/harvard-edge/QuaRL?label=last%20update) | [QuaRL](https://github.com/harvard-edge/QuaRL/) | tensorflow-2.x |
![GitHub stars - commit/astooke/accel_rl?label=last%20update) | [accel_rl](https://github.com/astooke/accel_rl/) | theano |
![GitHub stars - commit/osudrl/apex?label=last%20update) | [apex](https://github.com/osudrl/apex/) | pytorch |
![GitHub stars - commit/danijar/embodied?label=last%20update) | [embodied](https://github.com/danijar/embodied/) | tensorflow |
![GitHub stars - commit/kngwyu/Rainy?label=last%20update) | [Rainy](https://github.com/kngwyu/Rainy/) | pytorch |
![GitHub stars - commit/lns/dapo?label=last%20update) | [dapo](https://github.com/lns/dapo/) | tesorflow-1.x |
![GitHub stars - commit/sdpkjc/abcdrl?label=last%20update) | [abcdrl](https://github.com/sdpkjc/abcdrl/) | pytorch |
![GitHub stars - blines/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/RobertTLange/gymnax-blines?label=last%20update) | [gymnax-blines](https://github.com/RobertTLange/gymnax-blines/) | jax |
![GitHub stars - commit/quantumiracle/MARS?label=last%20update) | [MARS](https://github.com/quantumiracle/MARS/) | pytorch |
![GitHub stars - commit/indylab/nxdo?label=last%20update) | [nxdo](https://github.com/indylab/nxdo/) | pytorch |
![GitHub stars - commit/facebookresearch/gala?label=last%20update) | [gala](https://github.com/facebookresearch/gala/) | tesorflow-1.x |
![GitHub stars - rl/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/RedTachyon/coltra-rl?label=last%20update) | [coltra-rl](https://github.com/RedTachyon/coltra-rl/) | pytorch |
![GitHub stars - RL/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/IouJenLiu/HTS-RL?label=last%20update) | [HTS-RL](https://github.com/IouJenLiu/HTS-RL/) | pytorch |
![GitHub stars - commit/lns/memoire?label=last%20update) | [memoire](https://github.com/lns/memoire/) | |
![GitHub stars - isir/xpag/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/perrin-isir/xpag?label=last%20update) | [xpag](https://github.com/perrin-isir/xpag/) | jax |
![GitHub stars - marl/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/semitable/fast-marl?label=last%20update) | [fast-marl](https://github.com/semitable/fast-marl/) | pytorch |
![GitHub stars - baseline/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/TinkTheBoush/haiku-baseline?label=last%20update) | [haiku-baseline](https://github.com/TinkTheBoush/haiku-baseline/) | jax |
![GitHub stars - ai/reinforcement/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/mindspore-ai/reinforcement?label=last%20update) | [reinforcement](https://github.com/mindspore-ai/reinforcement/) | mindspore |
![GitHub stars - commit/jsw7460/sb3_jax?label=last%20update) | [sb3_jax](https://github.com/jsw7460/sb3_jax/) | jax |
![GitHub stars - commit/exalearn/exarl?label=last%20update) | [exarl](https://github.com/exalearn/exarl/) | tf-2.x |
![GitHub stars - wojnar/reinforced-lib/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/m-wojnar/reinforced-lib?label=last%20update) | [reinforced-lib](https://github.com/m-wojnar/reinforced-lib/) | jax |
![GitHub stars - commit/yannickycpan/reproduceRL?label=last%20update) | [reproduceRL](https://github.com/yannickycpan/reproduceRL/) | tensorflow-1.x |
![GitHub stars - life-is-a-game/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/cpuheater/cause-life-is-a-game?label=last%20update) | [cause-life-is-a-game](https://github.com/cpuheater/cause-life-is-a-game/) | pytorch |
![GitHub stars - jax/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/kchua/mbrl-jax?label=last%20update) | [mbrl-jax](https://github.com/kchua/mbrl-jax/) | jax |
![GitHub stars - commit/tinyzqh/XuanJing?label=last%20update) | [XuanJing](https://github.com/tinyzqh/XuanJing/) | pytorch |
![GitHub stars - mbrl/stargazers) | ![GitHub last commit](https://img.shields.io/github/last-commit/FrankTianTT/causal-mbrl?label=last%20update) | [causal-mbrl](https://github.com/FrankTianTT/causal-mbrl/) | pytorch |
[1
![GitHub stars - sg/envpool/stargazers) | [![arXiv](https://img.shields.io/badge/arXiv-2206.10558-b31b1b.svg)](https://arxiv.org/abs/2206.10558) | ![GitHub last commit](https://img.shields.io/github/last-commit/sail-sg/envpool?label=last%20update) | [EnvPool](https://github.com/sail-sg/envpool/) | cpp | Atari, Mujoco, Compilable environment |
![GitHub stars - 1707.01067-b31b1b.svg)](https://arxiv.org/abs/1707.01067) | ![GitHub last commit](https://img.shields.io/github/last-commit/facebookresearch/ELF?label=last%20update) | [ELF](https://github.com/facebookresearch/ELF/) | cpp | Game in cpp, MiniRTS |
![GitHub stars - 1907.08467-b31b1b.svg)](https://arxiv.org/abs/1907.08467) | ![GitHub last commit](https://img.shields.io/github/last-commit/NVlabs/cule?label=last%20update) | [Cule](https://github.com/NVlabs/cule/) | gpu | Atari |
![GitHub stars - 2106.13281-b31b1b.svg)](https://arxiv.org/abs/2106.13281) | ![GitHub last commit](https://img.shields.io/github/last-commit/google/brax?label=last%20update) | [Brax](https://github.com/google/brax/) | gpu | robot |
![GitHub stars - Omniverse/IsaacGymEnvs/stargazers) | [![arXiv](https://img.shields.io/badge/arXiv-2108.10470-b31b1b.svg)](https://arxiv.org/abs/2108.10470) | ![GitHub last commit](https://img.shields.io/github/last-commit/NVIDIA-Omniverse/IsaacGymEnvs?label=last%20update) | [Isaac-gym](https://github.com/NVIDIA-Omniverse/IsaacGymEnvs/) | gpu | robot |
![GitHub stars - drive/stargazers) | [![arXiv](https://img.shields.io/badge/arXiv-2108.13976-b31b1b.svg)](https://arxiv.org/abs/2108.13976) | ![GitHub last commit](https://img.shields.io/github/last-commit/salesforce/warp-drive?label=last%20update) | [WarpDrive](https://github.com/salesforce/warp-drive/) | gpu | multiagent |
![GitHub stars - commit/Bam4d/Griddly?label=last%20update) | [griddly](https://github.com/Bam4d/Griddly/) | cpp | grid-world game |
![GitHub stars - commit/kvfrans/powderworld?label=last%20update) | [powderworld](https://github.com/kvfrans/powderworld/) | gpu | physics lightweight simulation environment |
![GitHub stars - commit/instadeepai/jumanji?label=last%20update) | [jumanji](https://github.com/instadeepai/jumanji/) | jit+xla | Game / Combinatorial |