https://github.com/wwxFromTju/awesome-reinforcement-learning-lib
GitHub's code repository is all you need
https://github.com/wwxFromTju/awesome-reinforcement-learning-lib
List: awesome-reinforcement-learning-lib
reinforcement-learning reinforcement-learning-algorithms reinforcementlearning
Last synced: 6 months ago
JSON representation
GitHub's code repository is all you need
- Host: GitHub
- URL: https://github.com/wwxFromTju/awesome-reinforcement-learning-lib
- Owner: wwxFromTju
- Created: 2022-04-12T09:25:52.000Z (about 3 years ago)
- Default Branch: main
- Last Pushed: 2023-03-21T04:40:01.000Z (about 2 years ago)
- Last Synced: 2024-05-21T13:31:38.168Z (about 1 year ago)
- Topics: reinforcement-learning, reinforcement-learning-algorithms, reinforcementlearning
- Homepage:
- Size: 132 KB
- Stars: 294
- Watchers: 2
- Forks: 36
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- ultimate-awesome - awesome-reinforcement-learning-lib - GitHub's code repository is all you need. (Other Lists / Julia Lists)
README
# Awesome Reinforcement Learning Library
[RL Library](#rl-library)
[RL Accelerated Environment](#rl-accelerated-environment)
## RL Library
| Star | Last Update | Name | Backend |
| ------------------------------------------------------------ | ------------------------------------------------------------ | ------------------------------------------------------------ | ----------------------------- |
| [](https://github.com/ray-project/ray/stargazers) |  | [ray-rllib](https://github.com/ray-project/ray/) | pytorch, tensorflow-2.x |
| [](https://github.com/openai/baselines/stargazers) |  | [baselines](https://github.com/openai/baselines/) | tesorflow-1.x |
| [](https://github.com/google/dopamine/stargazers) |  | [dopamine](https://github.com/google/dopamine/) | tensorflow-2.x, tesorflow-1.x |
| [](https://github.com/openai/spinningup/stargazers) |  | [spinningup](https://github.com/openai/spinningup/) | pytorch, tesorflow-1.x |
| [](https://github.com/tensorlayer/TensorLayer/stargazers) |  | [TensorLayer](https://github.com/tensorlayer/TensorLayer/) | tensorflow-2.x |
| [](https://github.com/thu-ml/tianshou/stargazers) |  | [tianshou](https://github.com/thu-ml/tianshou/) | pytorch |
| [](https://github.com/keras-rl/keras-rl/stargazers) |  | [keras-rl](https://github.com/keras-rl/keras-rl/) | keras |
| [](https://github.com/DLR-RM/stable-baselines3/stargazers) |  | [stable-baselines3](https://github.com/DLR-RM/stable-baselines3/) | pytorch |
| [](https://github.com/p-christ/Deep-Reinforcement-Learning-Algorithms-with-PyTorch/stargazers) |  | [Deep-Reinforcement-Learning-Algorithms-with-PyTorch](https://github.com/p-christ/Deep-Reinforcement-Learning-Algorithms-with-PyTorch/) | pytorch |
| [](https://github.com/deepmind/open_spiel/stargazers) |  | [open_spiel](https://github.com/deepmind/open_spiel/) | pytorch, tensorflow-2.x |
| [](https://github.com/facebookresearch/ReAgent/stargazers) |  | [ReAgent](https://github.com/facebookresearch/ReAgent/) | pytorch |
| [](https://github.com/kwai/DouZero/stargazers) |  | [DouZero](https://github.com/kwai/DouZero/) | pytorch |
| [](https://github.com/tensorforce/tensorforce/stargazers) |  | [tensorforce](https://github.com/tensorforce/tensorforce/) | tensorflow-2.x |
| [](https://github.com/deepmind/acme/stargazers) |  | [acme](https://github.com/deepmind/acme/) | jax, tensorflow-2.x |
| [](https://github.com/ikostrikov/pytorch-a2c-ppo-acktr-gail/stargazers) |  | [pytorch-a2c-ppo-acktr-gail](https://github.com/ikostrikov/pytorch-a2c-ppo-acktr-gail/) | pytorch |
| [](https://github.com/deepmind/trfl/stargazers) |  | [trfl](https://github.com/deepmind/trfl/) | tensorflow-2.x, tesorflow-1.x |
| [](https://github.com/PaddlePaddle/PARL/stargazers) |  | [PARL](https://github.com/PaddlePaddle/PARL/) | paddle, pytorch |
| [](https://github.com/AI4Finance-Foundation/ElegantRL/stargazers) |  | [ElegantRL](https://github.com/AI4Finance-Foundation/ElegantRL/) | pytorch |
| [](https://github.com/tensorflow/agents/stargazers) |  | [agents](https://github.com/tensorflow/agents/) | tensorflow-2.x, tesorflow-1.x |
| [](https://github.com/opendilab/DI-engine/stargazers) |  | [DI-engine](https://github.com/opendilab/DI-engine/) | pytorch |
| [](https://github.com/vwxyzjn/cleanrl/stargazers) |  | [cleanrl](https://github.com/vwxyzjn/cleanrl/) | pytorch |
| [](https://github.com/IntelLabs/coach/stargazers) |  | [coach](https://github.com/IntelLabs/coach/) | tesorflow-1.x |
| [](https://github.com/datamllab/rlcard/stargazers) |  | [rlcard](https://github.com/datamllab/rlcard/) | pytorch |
| [](https://github.com/rail-berkeley/rlkit/stargazers) |  | [rlkit](https://github.com/rail-berkeley/rlkit/) | pytorch |
| [](https://github.com/astooke/rlpyt/stargazers) |  | [rlpyt](https://github.com/astooke/rlpyt/) | pytorch |
| [](https://github.com/rlworkgroup/garage/stargazers) |  | [garage](https://github.com/rlworkgroup/garage/) | tensorflow-2.x |
| [](https://github.com/kengz/SLM-Lab/stargazers) |  | [SLM-Lab](https://github.com/kengz/SLM-Lab/) | pytorch |
| [](https://github.com/chainer/chainerrl/stargazers) |  | [chainerrl](https://github.com/chainer/chainerrl/) | chainer |
| [](https://github.com/facebookresearch/rl/stargazers) |  | [rl](https://github.com/facebookresearch/rl/) | pytorch |
| [](https://github.com/pfnet/pfrl/stargazers) |  | [pfrl](https://github.com/pfnet/pfrl/) | pytorch |
| [](https://github.com/deepmind/rlax/stargazers) |  | [rlax](https://github.com/deepmind/rlax/) | jax |
| [](https://github.com/google-research/batch-ppo/stargazers) |  | [batch-ppo](https://github.com/google-research/batch-ppo/) | tesorflow-1.x |
| [](https://github.com/deepmind/scalable_agent/stargazers) |  | [scalable_agent](https://github.com/deepmind/scalable_agent/) | tesorflow-1.x |
| [](https://github.com/takuseno/d3rlpy/stargazers) |  | [d3rlpy](https://github.com/takuseno/d3rlpy/) | pytorch |
| [](https://github.com/google-research/seed_rl/stargazers) |  | [seed_rl](https://github.com/google-research/seed_rl/) | tensorflow-2.x |
| [](https://github.com/facebookresearch/mbrl-lib/stargazers) |  | [mbrl-lib](https://github.com/facebookresearch/mbrl-lib/) | pytorch |
| [](https://github.com/facebookresearch/torchbeast/stargazers) |  | [torchbeast](https://github.com/facebookresearch/torchbeast/) | pytorch |
| [](https://github.com/MushroomRL/mushroom-rl/stargazers) |  | [mushroom-rl](https://github.com/MushroomRL/mushroom-rl/) | pytorch |
| [](https://github.com/deepmind/reverb/stargazers) |  | [reverb](https://github.com/deepmind/reverb/) | jax, tensorflow-2.x |
| [](https://github.com/NVlabs/GA3C/stargazers) |  | [GA3C](https://github.com/NVlabs/GA3C/) | tesorflow-1.x |
| [](https://github.com/cpnota/autonomous-learning-library/stargazers) |  | [autonomous-learning-library](https://github.com/cpnota/autonomous-learning-library/) | pytorch |
| [](https://github.com/tinkoff-ai/CORL/stargazers) |  | [CORL](https://github.com/tinkoff-ai/CORL/) | pytorch |
| [](https://github.com/alex-petrenko/sample-factory/stargazers) |  | [sample-factory](https://github.com/alex-petrenko/sample-factory/) | pytorch |
| [](https://github.com/lcswillems/rl-starter-files/stargazers) |  | [rl-starter-files](https://github.com/lcswillems/rl-starter-files/) | pytorch |
| [](https://github.com/VinF/deer/stargazers) |  | [deer](https://github.com/VinF/deer/) | tensorflow-2.x |
| [](https://github.com/SurrealAI/surreal/stargazers) |  | [surreal](https://github.com/SurrealAI/surreal/) | pytorch |
| [](https://github.com/medipixel/rl_algorithms/stargazers) |  | [rl_algorithms](https://github.com/medipixel/rl_algorithms/) | pytorch |
| [](https://github.com/dongminlee94/deep_rl/stargazers) |  | [deep_rl](https://github.com/dongminlee94/deep_rl/) | pytorch |
| [](https://github.com/ikostrikov/jaxrl/stargazers) |  | [jaxrl](https://github.com/ikostrikov/jaxrl/) | jax |
| [](https://github.com/Rafael1s/Deep-Reinforcement-Learning-Algorithms/stargazers) |  | [Deep-Reinforcement-Learning-Algorithms](https://github.com/Rafael1s/Deep-Reinforcement-Learning-Algorithms/) | pytorch |
| [](https://github.com/eleurent/rl-agents/stargazers) |  | [rl-agents](https://github.com/eleurent/rl-agents/) | pytorch |
| [](https://github.com/google-research/batch_rl/stargazers) |  | [batch_rl](https://github.com/google-research/batch_rl/) | tensorflow-2.x |
| [](https://github.com/StepNeverStop/RLs/stargazers) |  | [RLs](https://github.com/StepNeverStop/RLs/) | pytorch |
| [](https://github.com/facebookresearch/salina/stargazers) |  | [salina](https://github.com/facebookresearch/salina/) | pytorch |
| [](https://github.com/Denys88/rl_games/stargazers) |  | [rl_games](https://github.com/Denys88/rl_games/) | pytorch |
| [](https://github.com/edbeeching/godot_rl_agents/stargazers) |  | [godot_rl_agents](https://github.com/edbeeching/godot_rl_agents/) | pytorch |
| [](https://github.com/SforAiDl/genrl/stargazers) |  | [genrl](https://github.com/SforAiDl/genrl/) | pytorch |
| [](https://github.com/fabiopardo/tonic/stargazers) |  | [tonic](https://github.com/fabiopardo/tonic/) | pytorch, tensorflow-2.x |
| [](https://github.com/zuoxingdong/lagom/stargazers) |  | [lagom](https://github.com/zuoxingdong/lagom/) | pytorch |
| [](https://github.com/sjtu-marl/malib/stargazers) |  | [malib](https://github.com/sjtu-marl/malib/) | pytorch |
| [](https://github.com/iffiX/machin/stargazers) |  | [machin](https://github.com/iffiX/machin/) | pytorch |
| [](https://github.com/kakaoenterprise/JORLDY/stargazers) |  | [JORLDY](https://github.com/kakaoenterprise/JORLDY/) | pytorch |
| [](https://github.com/rlgraph/rlgraph/stargazers) |  | [rlgraph](https://github.com/rlgraph/rlgraph/) | pytorch, tesorflow-1.x |
| [](https://github.com/facebookresearch/rlmeta/stargazers) |  | [rlmeta](https://github.com/facebookresearch/rlmeta/) | pytorch |
| [](https://github.com/rll-research/url_benchmark/stargazers) |  | [url_benchmark](https://github.com/rll-research/url_benchmark/) | pytorch |
| [](https://github.com/uoe-agents/epymarl/stargazers) |  | [epymarl](https://github.com/uoe-agents/epymarl/) | pytorch |
| [](https://github.com/huawei-noah/xingtian/stargazers) |  | [xingtian](https://github.com/huawei-noah/xingtian/) | tesorflow-1.x |
| [](https://github.com/DeNA/HandyRL/stargazers) |  | [HandyRL](https://github.com/DeNA/HandyRL/) | pytorch |
| [](https://github.com/facebookresearch/rlstructures/stargazers) |  | [rlstructures](https://github.com/facebookresearch/rlstructures/) | pytorch |
| [](https://github.com/RITCHIEHuang/DeepRL_Algorithms/stargazers) |  | [DeepRL_Algorithms](https://github.com/RITCHIEHuang/DeepRL_Algorithms/) | pytorch, tensorflow-2.x |
| [](https://github.com/infer-actively/pymdp/stargazers) |  | [pymdp](https://github.com/infer-actively/pymdp/) | numpy |
| [](https://github.com/Stable-Baselines-Team/stable-baselines/stargazers) |  | [stable-baselines](https://github.com/Stable-Baselines-Team/stable-baselines/) | tesorflow-1.x |
| [](https://github.com/david-abel/simple_rl/stargazers) |  | [simple_rl](https://github.com/david-abel/simple_rl/) | numpy |
| [](https://github.com/HorizonRobotics/alf/stargazers) |  | [alf](https://github.com/HorizonRobotics/alf/) | pytorch, Tensorflow 2.1 |
| [](https://github.com/trackmania-rl/tmrl/stargazers) |  | [tmrl](https://github.com/trackmania-rl/tmrl/) | pytorch |
| [](https://github.com/Alfredvc/paac/stargazers) |  | [paac](https://github.com/Alfredvc/paac/) | tesorflow-1.x |
| [](https://github.com/heronsystems/adeptRL/stargazers) |  | [adeptRL](https://github.com/heronsystems/adeptRL/) | pytorch |
| [](https://github.com/twni2016/pomdp-baselines/stargazers) |  | [pomdp-baselines](https://github.com/twni2016/pomdp-baselines/) | pytorch |
| [](https://github.com/Toni-SM/skrl/stargazers) |  | [skrl](https://github.com/Toni-SM/skrl/) | pytorch |
| [](https://github.com/uber-research/ape-x/stargazers) |  | [ape-x](https://github.com/uber-research/ape-x/) | tesorflow-1.x |
| [](https://github.com/facebookresearch/mtrl/stargazers) |  | [mtrl](https://github.com/facebookresearch/mtrl/) | pytorch |
| [](https://github.com/alibaba/EasyReinforcementLearning/stargazers) |  | [EasyReinforcementLearning](https://github.com/alibaba/EasyReinforcementLearning/) | tesorflow-1.x |
| [](https://github.com/RchalYang/torchrl/stargazers) |  | [torchrl](https://github.com/RchalYang/torchrl/) | pytorch |
| [](https://github.com/inspirai/TimeChamber/stargazers) |  | [TimeChamber](https://github.com/inspirai/TimeChamber/) | pytorch |
| [](https://github.com/google-research/rlds/stargazers) |  | [rlds](https://github.com/google-research/rlds/) | tensorflow-2.x |
| [](https://github.com/coax-dev/coax/stargazers) |  | [coax](https://github.com/coax-dev/coax/) | jax |
| [](https://github.com/tencent-ailab/tleague_projpage/stargazers) |  | [tleague_projpage](https://github.com/tencent-ailab/tleague_projpage/) | tesorflow-1.x |
| [](https://github.com/rlberry-py/rlberry/stargazers) |  | [rlberry](https://github.com/rlberry-py/rlberry/) | jax, pytorch |
| [](https://github.com/Ericonaldo/ILSwiss/stargazers) |  | [ILSwiss](https://github.com/Ericonaldo/ILSwiss/) | pytorch |
| [](https://github.com/google/deluca/stargazers) |  | [deluca](https://github.com/google/deluca/) | jax |
| [](https://github.com/sony/nnabla-rl/stargazers) |  | [nnabla-rl](https://github.com/sony/nnabla-rl/) | nnabla |
| [](https://github.com/schatty/d4pg-pytorch/stargazers) |  | [d4pg-pytorch](https://github.com/schatty/d4pg-pytorch/) | pytorch |
| [](https://github.com/ethanluoyc/magi/stargazers) |  | [magi](https://github.com/ethanluoyc/magi/) | jax |
| [](https://github.com/spitis/mrl/stargazers) |  | [mrl](https://github.com/spitis/mrl/) | pytorch |
| [](https://github.com/leggedrobotics/rsl_rl/stargazers) |  | [rsl_rl](https://github.com/leggedrobotics/rsl_rl/) | pytorch |
| [](https://github.com/cyoon1729/distributedRL/stargazers) |  | [distributedRL](https://github.com/cyoon1729/distributedRL/) | pytorch |
| [](https://github.com/araffin/sbx/stargazers) |  | [sbx](https://github.com/araffin/sbx/) | jax |
| [](https://github.com/facebookresearch/rela/stargazers) |  | [rela](https://github.com/facebookresearch/rela/) | pytorch |
| [](https://github.com/chandar-lab/RLHive/stargazers) |  | [RLHive](https://github.com/chandar-lab/RLHive/) | torch |
| [](https://github.com/google-research/deep_ope/stargazers) |  | [deep_ope](https://github.com/google-research/deep_ope/) | tensorflow-2.x |
| [](https://github.com/ku2482/rljax/stargazers) |  | [rljax](https://github.com/ku2482/rljax/) | jax |
| [](https://github.com/qlan3/Explorer/stargazers) |  | [Explorer](https://github.com/qlan3/Explorer/) | pytorch |
| [](https://github.com/x35f/unstable_baselines/stargazers) |  | [unstable_baselines](https://github.com/x35f/unstable_baselines/) | tensorflow-2.x |
| [](https://github.com/henry-prior/jax-rl/stargazers) |  | [jax-rl](https://github.com/henry-prior/jax-rl/) | jax |
| [](https://github.com/horoiwa/deep_reinforcement_learning_gallery/stargazers) |  | [deep_reinforcement_learning_gallery](https://github.com/horoiwa/deep_reinforcement_learning_gallery/) | tensorflow-2.x |
| [](https://github.com/ymd-h/cpprb/stargazers) |  | [cpprb](https://github.com/ymd-h/cpprb/) | |
| [](https://github.com/google/simple-reinforcement-learning/stargazers) |  | [simple-reinforcement-learning](https://github.com/google/simple-reinforcement-learning/) | tesorflow-1.x |
| [](https://github.com/hari-sikchi/safeRL/stargazers) |  | [safeRL](https://github.com/hari-sikchi/safeRL/) | pytorch |
| [](https://github.com/stepjam/YARR/stargazers) |  | [YARR](https://github.com/stepjam/YARR/) | pytorch |
| [](https://github.com/clvoloshin/COBS/stargazers) |  | [COBS](https://github.com/clvoloshin/COBS/) | pytorch, tensorflow-2.x |
| [](https://github.com/Shanghai-Digital-Brain-Laboratory/DB-Football/stargazers) |  | [DB-Football](https://github.com/Shanghai-Digital-Brain-Laboratory/DB-Football) | pytorch |
| [](https://github.com/0xangelo/raylab/stargazers) |  | [raylab](https://github.com/0xangelo/raylab/) | pytorch |
| [](https://github.com/instadeepai/fastpbrl/stargazers) |  | [fastpbrl](https://github.com/instadeepai/fastpbrl/) | jax, pytorch |
| [](https://github.com/harvard-edge/QuaRL/stargazers) |  | [QuaRL](https://github.com/harvard-edge/QuaRL/) | tensorflow-2.x |
| [](https://github.com/astooke/accel_rl/stargazers) |  | [accel_rl](https://github.com/astooke/accel_rl/) | theano |
| [](https://github.com/osudrl/apex/stargazers) |  | [apex](https://github.com/osudrl/apex/) | pytorch |
| [](https://github.com/danijar/embodied/stargazers) |  | [embodied](https://github.com/danijar/embodied/) | tensorflow |
| [](https://github.com/kngwyu/Rainy/stargazers) |  | [Rainy](https://github.com/kngwyu/Rainy/) | pytorch |
| [](https://github.com/lns/dapo/stargazers) |  | [dapo](https://github.com/lns/dapo/) | tesorflow-1.x |
| [](https://github.com/sdpkjc/abcdrl/stargazers) |  | [abcdrl](https://github.com/sdpkjc/abcdrl/) | pytorch |
| [](https://github.com/RobertTLange/gymnax-blines/stargazers) |  | [gymnax-blines](https://github.com/RobertTLange/gymnax-blines/) | jax |
| [](https://github.com/quantumiracle/MARS/stargazers) |  | [MARS](https://github.com/quantumiracle/MARS/) | pytorch |
| [](https://github.com/indylab/nxdo/stargazers) |  | [nxdo](https://github.com/indylab/nxdo/) | pytorch |
| [](https://github.com/facebookresearch/gala/stargazers) |  | [gala](https://github.com/facebookresearch/gala/) | tesorflow-1.x |
| [](https://github.com/RedTachyon/coltra-rl/stargazers) |  | [coltra-rl](https://github.com/RedTachyon/coltra-rl/) | pytorch |
| [](https://github.com/IouJenLiu/HTS-RL/stargazers) |  | [HTS-RL](https://github.com/IouJenLiu/HTS-RL/) | pytorch |
| [](https://github.com/lns/memoire/stargazers) |  | [memoire](https://github.com/lns/memoire/) | |
| [](https://github.com/perrin-isir/xpag/stargazers) |  | [xpag](https://github.com/perrin-isir/xpag/) | jax |
| [](https://github.com/semitable/fast-marl/stargazers) |  | [fast-marl](https://github.com/semitable/fast-marl/) | pytorch |
| [](https://github.com/TinkTheBoush/haiku-baseline/stargazers) |  | [haiku-baseline](https://github.com/TinkTheBoush/haiku-baseline/) | jax |
| [](https://github.com/mindspore-ai/reinforcement/stargazers) |  | [reinforcement](https://github.com/mindspore-ai/reinforcement/) | mindspore |
| [](https://github.com/jsw7460/sb3_jax/stargazers) |  | [sb3_jax](https://github.com/jsw7460/sb3_jax/) | jax |
| [](https://github.com/exalearn/exarl/stargazers) |  | [exarl](https://github.com/exalearn/exarl/) | tf-2.x |
| [](https://github.com/m-wojnar/reinforced-lib/stargazers) |  | [reinforced-lib](https://github.com/m-wojnar/reinforced-lib/) | jax |
| [](https://github.com/yannickycpan/reproduceRL/stargazers) |  | [reproduceRL](https://github.com/yannickycpan/reproduceRL/) | tensorflow-1.x |
| [](https://github.com/cpuheater/cause-life-is-a-game/stargazers) |  | [cause-life-is-a-game](https://github.com/cpuheater/cause-life-is-a-game/) | pytorch |
| [](https://github.com/kchua/mbrl-jax/stargazers) |  | [mbrl-jax](https://github.com/kchua/mbrl-jax/) | jax |
| [](https://github.com/tinyzqh/XuanJing/stargazers) |  | [XuanJing](https://github.com/tinyzqh/XuanJing/) | pytorch |
| [](https://github.com/FrankTianTT/causal-mbrl/stargazers) |  | [causal-mbrl](https://github.com/FrankTianTT/causal-mbrl/) | pytorch |## RL Accelerated Environment
| Star | arXiv | Last Update | Name | Accelerate Type | Property |
| ---------------------------------------------------------- | ------ | ------------------------------------------------------------ | ------------------------------------------------------------ | ------------------------------------------------------------ | ------------------------------------- |
| / | / | / | vec_env | subproc [[1]](https://github.com/openai/baselines/tree/master/baselines/common/vec_env) [[2]](https://github.com/openai/baselines/blob/master/baselines/common/vec_env/subproc_vec_env.py) | all |
| [](https://github.com/sail-sg/envpool/stargazers) | [](https://arxiv.org/abs/2206.10558) |  | [EnvPool](https://github.com/sail-sg/envpool/) | cpp | Atari, Mujoco, Compilable environment |
| [](https://github.com/facebookresearch/ELF/stargazers) | [](https://arxiv.org/abs/1707.01067) |  | [ELF](https://github.com/facebookresearch/ELF/) | cpp | Game in cpp, MiniRTS |
| [](https://github.com/NVlabs/cule/stargazers) | [](https://arxiv.org/abs/1907.08467) |  | [Cule](https://github.com/NVlabs/cule/) | gpu | Atari |
| [](https://github.com/google/brax/stargazers) | [](https://arxiv.org/abs/2106.13281) |  | [Brax](https://github.com/google/brax/) | gpu | robot |
| [](https://github.com/NVIDIA-Omniverse/IsaacGymEnvs/stargazers) | [](https://arxiv.org/abs/2108.10470) |  | [Isaac-gym](https://github.com/NVIDIA-Omniverse/IsaacGymEnvs/) | gpu | robot |
| [](https://github.com/salesforce/warp-drive/stargazers) | [](https://arxiv.org/abs/2108.13976) |  | [WarpDrive](https://github.com/salesforce/warp-drive/) | gpu | multiagent |
| [](https://github.com/Bam4d/Griddly/stargazers) | / |  | [griddly](https://github.com/Bam4d/Griddly/) | cpp | grid-world game |
| [](https://github.com/kvfrans/powderworld/stargazers) | / |  | [powderworld](https://github.com/kvfrans/powderworld/) | gpu | physics lightweight simulation environment |
| [](https://github.com/instadeepai/jumanji/stargazers) | / |  | [jumanji](https://github.com/instadeepai/jumanji/) | jit+xla | Game / Combinatorial |