Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

awesome-deep-reinforcement-learning

Curated list for Deep Reinforcement Learning (DRL): software frameworks, models, datasets, gyms, baselines...
https://github.com/jgvictores/awesome-deep-reinforcement-learning

scikit-learn
scikit-image
microsoft/DirectML
safari
presentation - deep-reinforcement-learning/blob/143a885cc10b4331b9b3fa3e1a9436d5325676af/doc/inria2017DLFrameworks.pdf)).
1 - docker), [3](https://github.com/bethgelab/docker-deeplearning).
1 - docker), [3](https://github.com/bethgelab/docker-deeplearning).
pytorch/pytorch - commit/pytorch/pytorch?label=last%20update)
keras-team/keras - team/keras)](https://github.com/keras-team/keras/stargazers) ![GitHub last commit](https://img.shields.io/github/last-commit/keras-team/keras?label=last%20update)
keras - learning-python), [2](https://elitedatascience.com/keras-tutorial-deep-learning-in-python)
safari
safari
tensorflow/tensorflow - commit/tensorflow/tensorflow?label=last%20update)
1 - docker), [3](https://github.com/bethgelab/docker-deeplearning).
flashlight/flashlight - commit/flashlight/flashlight?label=last%20update)
https://github.com/janhuenermann/neurojs - commit/janhuenermann/neurojs?label=last%20update)
ONNX
OpenCV
Chainer
1 - docker), [3](https://github.com/bethgelab/docker-deeplearning).
DALI - accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
Sonnet
MXNet
1 - docker), [3](https://github.com/bethgelab/docker-deeplearning).
Darknet
ml5
DL4J
oneapi-src/oneDNN
sony/nnabla
Torch
1 - docker), [3](https://github.com/bethgelab/docker-deeplearning).
jittor
PaddlePaddle
CoreML - C) (support: Apple)
GitHub
GitHub
GitHub
GitHub
OpenNN
PyBrain
Caffe
MILA stopped developing
1 - docker), [3](https://github.com/bethgelab/docker-deeplearning).
facebookresearch/Detectron
arxiv - fcis).
arxiv - freiburg.de/people/ronneber/u-net/).
arxiv
arxiv - Single-Shot-MultiBox-Detector)
arxiv
arxiv - brief-history-of-cnns-in-image-segmentation-from-r-cnn-to-mask-r-cnn-34ea83205de4)): Fast R-CNN, Faster R-CNN, Mask R-CNN.
1 - docker), [3](https://github.com/bethgelab/docker-deeplearning).
arxiv
arxiv
arxiv
arxiv
arxiv - 3 weeks.
arxiv - 7 million parameters, via smaller convs. A more aggressive cropping approach than that of Krizhevsky. Batch normalization, image distortions, RMSprop. Uses 9 novel "Inception modules" (at each layer of a traditional ConvNet, you have to make a choice of whether to have a pooling operation or a conv operation as well as the choice of filter size; an Inception module performa all these operations in parallel), and no fully connected. Trained on CPU (estimated as weeks via GPU) implemented in DistBelief (closed-source predecessor of TensorFlow). Variants ([summary](https://towardsdatascience.com/a-simple-guide-to-the-versions-of-the-inception-network-7fc52b863202)): v1, v2, v4, resnet v1, resnet v2; v9 ([slides](http://lsun.cs.princeton.edu/slides/Christian.pdf)). Also see [Xception (2017)](https://arxiv.org/pdf/1610.02357.pdf) paper.
arxiv
doi - justified finer tuning and visualization (namely Deconvolutional Network).
doi - 61 million parameters, split into 2 pipelines to enable 5-6 day GTX 580 GPU training (while CPU data augmentation).
doi
thunlp/GNNPapers
Geometric deep learning
chihming/awesome-network-embedding
DLG
pytorch
tensorflow/gnn
pytorch
ref
caffe
tensorflow - tutorial-fine-tuning-using-pre-trained-models/)
arxiv - painterly-harmonization)
arxiv - photo-styletransfer)
arxiv - style), keras [1](https://github.com/keras-team/keras/blob/master/examples/neural_style_transfer.py) [2](https://github.com/titu1994/Neural-Style-Transfer) [3](https://github.com/handong1587/handong1587.github.io/blob/master/_posts/deep_learning/2015-10-09-fun-with-deep-learning.md) [4](https://medium.com/mlreview/making-ai-art-with-style-transfer-using-keras-8bb5fa44b216)
hindupuravinash/the-gan-zoo
arxiv - pytorch)
arxiv
arxiv - Adversarial-Networks)
CycleGAN - Yan Zhu et Al; Berkeley; "Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks". [torch](https://github.com/junyanz/CycleGAN) and migrated to [pytorch](https://github.com/junyanz/pytorch-CycleGAN-and-pix2pix).
arxiv
arxiv
FTTNet - Time Speaker-Dependent Neural Vocoder". [pytorch](https://github.com/mozilla/FFTNet)
WaveNet
keras
word2vec
keras - image-similarity)
arxiv
wikipedia
awesomedata/awesome-public-datasets
MIT Places
MNIST - MNIST](https://github.com/rois-codh/kmnist).
ImageNet
PASCAL VOC
CIFAR-10
CIFAR-100
MIT MM Stimuli
SVHN
HICO
Visual Genome
COCO
Quick Draw (Google)
iCubWorld - camera-dataset](https://github.com/muratkrty/iCub-camera-dataset).
Kinetics (DeepMind)
HowTo100M
text8
UMICH SI650
wikipedia
DOI: 10.1145/3447526.3472059
brain-research/realistic-ssl-evaluation
keras web - team/keras/tree/master/keras/applications), [keras 2](https://github.com/keras-team/keras-applications), [pytorch](https://pytorch.org/docs/stable/torchvision/models.html), [caffe](https://github.com/BVLC/caffe/wiki/Model-Zoo), [ONNX](https://github.com/onnx/models) (pytorch/caffe2).
keras
keras - 10 weights](https://drive.google.com/open?id=0B4odNGNGJ56qVW9JdkthbzBsX28) / [keras CIFAR-100 weights](https://drive.google.com/open?id=0B4odNGNGJ56qTEdnT1RjTU44Zms)
keras by keras - team/keras/tree/e15533e6c725dca8c37a861aacb13ef149789433/keras/applications)) / [keras by kaggle](https://www.kaggle.com/keras) / [pytorch by kaggle](https://www.kaggle.com/pytorch)
keras
keras
caffe by original VGG author
gensim
keras
wikipedia - activations/), [ref](https://towardsdatascience.com/deep-study-of-a-not-very-deep-neural-network-part-2-activation-functions-fd9bd8d406fc).
keras
keras
facebookresearch/nevergrad
keras
wikipedia
keras
wikipedia - validation).
tensorflow - tutorial-fine-tuning-using-pre-trained-models/)
tensorboard
tensorboardX
keras - deep-learning-neural-network-model-keras/), [2](https://github.com/keplr-io/quiver), [3](https://raghakot.github.io/keras-vis/), [4](https://www.kaggle.com/amarjeet007/visualize-cnn-with-keras)
tensorflow online demo
loss-landscape
netscope
slundberg/shap
EthicalML/xai
Reinforcement Learning Specialization - 20). Note that another major separation is off/on policy RL algorithms. DRL methods would fit into function approximators.
Deep Reinforcement Learning CS 285 at UC Berkeley - fa20/), Lecture 4.
Part 2: Kinds of RL Algorithms - Rendered from <https://github.com/openai/spinningup/blob/038665d62d569055401d91856abb287263096178/docs/spinningup/rl_intro2.rst>
ray-project/ray - project/ray)](https://github.com/ray-project/ray/stargazers) ![GitHub last commit](https://img.shields.io/github/last-commit/ray-project/ray?label=last%20update) (Ray total) (also covers multiagent)
Unity-Technologies/ml-agents
google/dopamine - commit/google/dopamine?label=last%20update)
keras-rl/keras-rl - rl/keras-rl)](https://github.com/keras-rl/keras-rl/stargazers) ![GitHub last commit](https://img.shields.io/github/last-commit/keras-rl/keras-rl?label=last%20update)
SoyGema/Startcraft_pysc2_minigames
thu-ml/tianshou - ml/tianshou)](https://github.com/thu-ml/tianshou/stargazers) ![GitHub last commit](https://img.shields.io/github/last-commit/thu-ml/tianshou?label=last%20update)
DLR-RM/stable-baselines3 - a/stable-baselines](https://github.com/hill-a/stable-baselines) fork of [openai/baselines](https://github.com/openai/baselines)) [![GitHub stars](https://img.shields.io/github/stars/DLR-RM/stable-baselines3)](https://github.com/DLR-RM/stable-baselines3/stargazers) ![GitHub last commit](https://img.shields.io/github/last-commit/DLR-RM/stable-baselines3?label=last%20update)
https://github.com/janhuenermann/neurojs - commit/janhuenermann/neurojs?label=last%20update)
deepmind/open_spiel - commit/deepmind/open_spiel?label=last%20update)
reinforceio/tensorforce - commit/reinforceio/tensorforce?label=last%20update)
deepmind/trfl - commit/deepmind/trfl?label=last%20update)
catalyst-team/catalyst - team/catalyst)](https://github.com/catalyst-team/catalyst/stargazers) ![GitHub last commit](https://img.shields.io/github/last-commit/catalyst-team/catalyst?label=last%20update)
deepmind/acme - commit/deepmind/acme?label=last%20update)
rll/rllab - commit/rll/rllab?label=last%20update)
tensorflow/agents - commit/tensorflow/agents?label=last%20update)
astooke/rlpyt - commit/astooke/rlpyt?label=last%20update)
rail-berkeley/rlkit - berkeley/rlkit)](https://github.com/rail-berkeley/rlkit/stargazers) ![GitHub last commit](https://img.shields.io/github/last-commit/rail-berkeley/rlkit?label=last%20update)
vwxyzjn/cleanrl - commit/vwxyzjn/cleanrl?label=last%20update)
oxwhirl/pymarl - agent reinforcement learning [![GitHub stars](https://img.shields.io/github/stars/oxwhirl/pymarl)](https://github.com/oxwhirl/pymarl/stargazers) ![GitHub last commit](https://img.shields.io/github/last-commit/oxwhirl/pymarl?label=last%20update)
deepmind/bsuite - commit/deepmind/bsuite?label=last%20update)
chainer/chainerrl - commit/chainer/chainerrl?label=last%20update)
facebookresearch/rl - commit/facebookresearch/rl?label=last%20update)
MushroomRL/mushroom-rl - rl)](https://github.com/MushroomRL/mushroom-rl/stargazers) ![GitHub last commit](https://img.shields.io/github/last-commit/MushroomRL/mushroom-rl?label=last%20update)
SurrealAI/surreal - commit/SurrealAI/surreal?label=last%20update)
medipixel/rl_algorithms - commit/medipixel/rl_algorithms?label=last%20update)
ikostrikov/jaxrl2 - commit/ikostrikov/jaxrl2?label=last%20update)
ikostrikov/jaxrl - commit/ikostrikov/jaxrl?label=last%20update)
tinkoff-ai/CORL - quality single-file implementations of SOTA Offline RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC" [![GitHub stars](https://img.shields.io/github/stars/tinkoff-ai/CORL)](https://github.com/tinkoff-ai/CORL/stargazers) ![GitHub last commit](https://img.shields.io/github/last-commit/tinkoff-ai/CORL?label=last%20update)
learnables/cherry - commit/learnables/cherry?label=last%20update)
trackmania-rl/tmrl - rl/tmrl)](https://github.com/trackmania-rl/tmrl/stargazers) ![GitHub last commit](https://img.shields.io/github/last-commit/trackmania-rl/tmrl?label=last%20update)
ethanluoyc/magi - commit/ethanluoyc/magi?label=last%20update)
RL-Glue - glue-ext/wikis/RLGlueCore.wiki)) (API: C/C++, Java, Matlab, Python, Lisp) (support: Alberta)
tensorflow/tensorflow - commit/tensorflow/tensorflow?label=last%20update)
pytorch/pytorch - commit/pytorch/pytorch?label=last%20update)
keras-team/keras - team/keras)](https://github.com/keras-team/keras/stargazers) ![GitHub last commit](https://img.shields.io/github/last-commit/keras-team/keras?label=last%20update)
google/jax - commit/google/jax?label=last%20update)
facebookresearch/mbrl-lib - lib)](https://github.com/facebookresearch/mbrl-lib/stargazers) ![GitHub last commit](https://img.shields.io/github/last-commit/facebookresearch/mbrl-lib?label=last%20update)
haarnoja/sac
Asap7772/PTR - Training for Robots: Leveraging Diverse Multitask Data via Offline Reinforcement Learning
ikostrikov/pytorch-a2c-ppo-acktr
openai/spinningup - commit/openai/spinningup?label=last%20update)
qfettes/DeepRL-Tutorials - Tutorials)](https://github.com/qfettes/DeepRL-Tutorials/stargazers) ![GitHub last commit](https://img.shields.io/github/last-commit/qfettes/DeepRL-Tutorials?label=last%20update)
Farama-Foundation/Gymnasium - Foundation/Gymnasium)](https://github.com/Farama-Foundation/Gymnasium/stargazers) ![GitHub last commit](https://img.shields.io/github/last-commit/Farama-Foundation/Gymnasium?label=last%20update). ~~DEPRECATED: [openai/gym](https://github.com/openai/gym), <https://gym.openai.com>, <https://gym.openai.com/docs/>~~
Farama-Foundation/Gymnasium-Robotics
Farama-Foundation/Minigrid
Farama-Foundation/MiniWorld
Farama-Foundation/ViZDoom
openai/roboschool
Unity-Technologies/ml-agents
Unity-Technologies/obstacle-tower-env
LucasAlegre/sumo-rl
qgallouedec/panda-gym
NVIDIA-Omniverse/IsaacGymEnvs
leggedrobotics/legged_gym
osudrl/cassie-mujoco-sim
utiasDSL/safe-control-gym
deepmind/bsuite
openai/gym-soccer
erlerobot/gym-gazebo
robotology/gym-ignition
dartsim/gym-dart
Roboy/gym-roboy
kngwyu/mujoco-maze
Improbable-AI/walk-these-ways
ucuapps/modelicagym
openai/safety-gym
openai/retro
deepmind/pysc2
benelot/pybullet-gym
Healthcare-Robotics/assistive-gym
Microsoft/malmo
nadavbh12/Retro-Learning-Environment
twitter/torch-twrl
duckietown/gym-duckietown
arex18/rocket-lander
ppaquette/gym-doom
eleurent/highway-env
thedimlebowski/Trading-Gym
denisyarats/dmc2gym
minerllabs/minerl
eugenevinitsky/sequential_social_dilemma_games
facebookresearch/minihack
UtkarshMishra04/bioimitation-gym
stanfordnmbl/osim-rl
upb-lea/gym-electric-motor
upb-lea/openmodelica-microgrid-gym
tobirohrer/building-energy-storage-simulation
intelligent-environments-lab/CityLearn
koulanurag/ma-gym
magni84/gym_bandits
ThomasLecat/gym-bandit-environments
JKCooper2/gym-bandits
diegoalejogm/openai-k-armed-bandits
Phylliade/awesome-openai-gym-environments
Unity-Technologies/marathon-envs
Farama-Foundation/PettingZoo - Foundation/PettingZoo)](https://github.com/Farama-Foundation/PettingZoo/stargazers) ![GitHub last commit](https://img.shields.io/github/last-commit/Farama-Foundation/PettingZoo?label=last%20update)
Farama-Foundation/MAgent2
Unity-Technologies/ml-agents
LucasAlegre/sumo-rl
Farama-Foundation/D4RL - berkeley/d4rl](https://github.com/rail-berkeley/d4rl)) [![GitHub stars](https://img.shields.io/github/stars/Farama-Foundation/D4RL)](https://github.com/Farama-Foundation/D4RL/stargazers) ![GitHub last commit](https://img.shields.io/github/last-commit/Farama-Foundation/D4RL?label=last%20update)
google-research/rlds - research/rlds)](https://github.com/google-research/rlds/stargazers) ![GitHub last commit](https://img.shields.io/github/last-commit/google-research/rlds?label=last%20update)
Farama-Foundation/Minari - Foundation/Kabuki](https://github.com/Farama-Foundation/Kabuki)) [![GitHub stars](https://img.shields.io/github/stars/Farama-Foundation/Minari)](https://github.com/Farama-Foundation/Minari/stargazers) ![GitHub last commit](https://img.shields.io/github/last-commit/Farama-Foundation/Minari?label=last%20update)
google-research/robel - research/robel)](https://github.com/google-research/robel/stargazers) ![GitHub last commit](https://img.shields.io/github/last-commit/google-research/robel?label=last%20update)
rlworkgroup/garage - commit/rlworkgroup/garage?label=last%20update)
stepjam/RLBench - scale benchmark and learning environment." [![GitHub stars](https://img.shields.io/github/stars/stepjam/RLBench)](https://github.com/stepjam/RLBench/stargazers) ![GitHub last commit](https://img.shields.io/github/last-commit/stepjam/RLBench?label=last%20update)
google-research/rliable - research/rliable)](https://github.com/google-research/rliable/stargazers) ![GitHub last commit](https://img.shields.io/github/last-commit/google-research/rliable?label=last%20update)
google-research/rl-reliability-metrics - research/rl-reliability-metrics)](https://github.com/google-research/rl-reliability-metrics/stargazers) ![GitHub last commit](https://img.shields.io/github/last-commit/google-research/rl-reliability-metrics?label=last%20update)
HYDesmondLiu/B2RL - commit/HYDesmondLiu/B2RL?label=last%20update)
deepmind/bsuite - designed experiments that investigate core capabilities of a reinforcement learning (RL) agent" [![GitHub stars](https://img.shields.io/github/stars/deepmind/bsuite)](https://github.com/deepmind/bsuite/stargazers) ![GitHub last commit](https://img.shields.io/github/last-commit/deepmind/bsuite?label=last%20update)
CMA-ES/pycma
hardmaru/estool
CyberAgent/cmaes
tigerneil/awesome-deep-rl
kengz/awesome-deep-rl
williamd4112/awesome-deep-reinforcement-learning
terryum/awesome-deep-learning-papers#new-papers
hanjuku-kaso/awesome-offline-rl
wwxFromTju/awesome-reinforcement-learning-lib

Programming Languages

Python 89 Jupyter Notebook 11 C++ 9 Lua 2 C# 2 Java 2 C 2 JavaScript 2 Scala 1 TeX 1

Keywords

reinforcement-learning 49 machine-learning 33 deep-learning 30 pytorch 18 python 18 gym 14 openai-gym 12 robotics 11 deep-reinforcement-learning 11 neural-network 10 tensorflow 10 rl 7 dqn 6 gymnasium 6 ml 5 ai 4 torch 4 deep-neural-networks 4 simulation 4 jax 4 openai 4 sac 3 ppo 3 gym-environment 3 mujoco 3 awesome-list 3 research 3 a2c 3 keras 3 neural-networks 3 atari 3 computer-vision 3 chainer 3 offline-reinforcement-learning 3 actor-critic 3 rl-algorithms 3 pybullet 3 control 3 openai-gym-environments 3 gpu 2 benchmark 2 d4rl 2 distributed-computing 2 ddpg 2 soft-actor-critic 2 caffe2 2 double-dqn 2 mxnet 2 caffe 2 drl 2