Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
awesome-deep-reinforcement-learning
Curated list for Deep Reinforcement Learning (DRL): software frameworks, models, datasets, gyms, baselines...
https://github.com/jgvictores/awesome-deep-reinforcement-learning
- scikit-learn
- scikit-image
- microsoft/DirectML
- safari
- presentation - deep-reinforcement-learning/blob/143a885cc10b4331b9b3fa3e1a9436d5325676af/doc/inria2017DLFrameworks.pdf)).
- 1 - docker), [3](https://github.com/bethgelab/docker-deeplearning).
- 1 - docker), [3](https://github.com/bethgelab/docker-deeplearning).
- pytorch/pytorch - commit/pytorch/pytorch?label=last%20update)
- keras-team/keras - team/keras)](https://github.com/keras-team/keras/stargazers) ![GitHub last commit](https://img.shields.io/github/last-commit/keras-team/keras?label=last%20update)
- keras - learning-python), [2](https://elitedatascience.com/keras-tutorial-deep-learning-in-python)
- safari
- safari
- tensorflow/tensorflow - commit/tensorflow/tensorflow?label=last%20update)
- 1 - docker), [3](https://github.com/bethgelab/docker-deeplearning).
- flashlight/flashlight - commit/flashlight/flashlight?label=last%20update)
- https://github.com/janhuenermann/neurojs - commit/janhuenermann/neurojs?label=last%20update)
- ONNX
- OpenCV
- Chainer
- 1 - docker), [3](https://github.com/bethgelab/docker-deeplearning).
- DALI - accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
- Sonnet
- MXNet
- 1 - docker), [3](https://github.com/bethgelab/docker-deeplearning).
- Darknet
- ml5
- DL4J
- oneapi-src/oneDNN
- sony/nnabla
- Torch
- 1 - docker), [3](https://github.com/bethgelab/docker-deeplearning).
- jittor
- PaddlePaddle
- CoreML - C) (support: Apple)
- GitHub
- GitHub
- GitHub
- GitHub
- OpenNN
- PyBrain
- Caffe
- MILA stopped developing
- 1 - docker), [3](https://github.com/bethgelab/docker-deeplearning).
- facebookresearch/Detectron
- arxiv - fcis).
- arxiv - freiburg.de/people/ronneber/u-net/).
- arxiv
- arxiv - Single-Shot-MultiBox-Detector)
- arxiv
- arxiv - brief-history-of-cnns-in-image-segmentation-from-r-cnn-to-mask-r-cnn-34ea83205de4)): Fast R-CNN, Faster R-CNN, Mask R-CNN.
- 1 - docker), [3](https://github.com/bethgelab/docker-deeplearning).
- arxiv
- arxiv
- arxiv
- arxiv
- arxiv - 3 weeks.
- arxiv - 7 million parameters, via smaller convs. A more aggressive cropping approach than that of Krizhevsky. Batch normalization, image distortions, RMSprop. Uses 9 novel "Inception modules" (at each layer of a traditional ConvNet, you have to make a choice of whether to have a pooling operation or a conv operation as well as the choice of filter size; an Inception module performa all these operations in parallel), and no fully connected. Trained on CPU (estimated as weeks via GPU) implemented in DistBelief (closed-source predecessor of TensorFlow). Variants ([summary](https://towardsdatascience.com/a-simple-guide-to-the-versions-of-the-inception-network-7fc52b863202)): v1, v2, v4, resnet v1, resnet v2; v9 ([slides](http://lsun.cs.princeton.edu/slides/Christian.pdf)). Also see [Xception (2017)](https://arxiv.org/pdf/1610.02357.pdf) paper.
- arxiv
- doi - justified finer tuning and visualization (namely Deconvolutional Network).
- doi - 61 million parameters, split into 2 pipelines to enable 5-6 day GTX 580 GPU training (while CPU data augmentation).
- doi
- thunlp/GNNPapers
- Geometric deep learning
- chihming/awesome-network-embedding
- DLG
- pytorch
- tensorflow/gnn
- pytorch
- ref
- caffe
- tensorflow - tutorial-fine-tuning-using-pre-trained-models/)
- arxiv - painterly-harmonization)
- arxiv - photo-styletransfer)
- arxiv - style), keras [1](https://github.com/keras-team/keras/blob/master/examples/neural_style_transfer.py) [2](https://github.com/titu1994/Neural-Style-Transfer) [3](https://github.com/handong1587/handong1587.github.io/blob/master/_posts/deep_learning/2015-10-09-fun-with-deep-learning.md) [4](https://medium.com/mlreview/making-ai-art-with-style-transfer-using-keras-8bb5fa44b216)
- hindupuravinash/the-gan-zoo
- arxiv - pytorch)
- arxiv
- arxiv - Adversarial-Networks)
- CycleGAN - Yan Zhu et Al; Berkeley; "Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks". [torch](https://github.com/junyanz/CycleGAN) and migrated to [pytorch](https://github.com/junyanz/pytorch-CycleGAN-and-pix2pix).
- arxiv
- arxiv
- FTTNet - Time Speaker-Dependent Neural Vocoder". [pytorch](https://github.com/mozilla/FFTNet)
- WaveNet
- keras
- word2vec
- keras - image-similarity)
- arxiv
- wikipedia
- awesomedata/awesome-public-datasets
- MIT Places
- MNIST - MNIST](https://github.com/rois-codh/kmnist).
- ImageNet
- PASCAL VOC
- CIFAR-10
- CIFAR-100
- MIT MM Stimuli
- SVHN
- HICO
- Visual Genome
- COCO
- Quick Draw (Google)
- iCubWorld - camera-dataset](https://github.com/muratkrty/iCub-camera-dataset).
- Kinetics (DeepMind)
- HowTo100M
- text8
- UMICH SI650
- wikipedia
- DOI: 10.1145/3447526.3472059
- brain-research/realistic-ssl-evaluation
- keras web - team/keras/tree/master/keras/applications), [keras 2](https://github.com/keras-team/keras-applications), [pytorch](https://pytorch.org/docs/stable/torchvision/models.html), [caffe](https://github.com/BVLC/caffe/wiki/Model-Zoo), [ONNX](https://github.com/onnx/models) (pytorch/caffe2).
- keras
- keras - 10 weights](https://drive.google.com/open?id=0B4odNGNGJ56qVW9JdkthbzBsX28) / [keras CIFAR-100 weights](https://drive.google.com/open?id=0B4odNGNGJ56qTEdnT1RjTU44Zms)
- keras by keras - team/keras/tree/e15533e6c725dca8c37a861aacb13ef149789433/keras/applications)) / [keras by kaggle](https://www.kaggle.com/keras) / [pytorch by kaggle](https://www.kaggle.com/pytorch)
- keras
- keras
- caffe by original VGG author
- gensim
- keras
- wikipedia - activations/), [ref](https://towardsdatascience.com/deep-study-of-a-not-very-deep-neural-network-part-2-activation-functions-fd9bd8d406fc).
- keras
- keras
- facebookresearch/nevergrad
- keras
- wikipedia
- keras
- wikipedia - validation).
- tensorflow - tutorial-fine-tuning-using-pre-trained-models/)
- tensorboard
- tensorboardX
- keras - deep-learning-neural-network-model-keras/), [2](https://github.com/keplr-io/quiver), [3](https://raghakot.github.io/keras-vis/), [4](https://www.kaggle.com/amarjeet007/visualize-cnn-with-keras)
- tensorflow online demo
- loss-landscape
- netscope
- slundberg/shap
- EthicalML/xai
- Reinforcement Learning Specialization - 20). Note that another major separation is off/on policy RL algorithms. DRL methods would fit into function approximators.
- Deep Reinforcement Learning CS 285 at UC Berkeley - fa20/), Lecture 4.
- Part 2: Kinds of RL Algorithms - Rendered from <https://github.com/openai/spinningup/blob/038665d62d569055401d91856abb287263096178/docs/spinningup/rl_intro2.rst>
- ray-project/ray - project/ray)](https://github.com/ray-project/ray/stargazers) ![GitHub last commit](https://img.shields.io/github/last-commit/ray-project/ray?label=last%20update) (Ray total) (also covers multiagent)
- Unity-Technologies/ml-agents
- google/dopamine - commit/google/dopamine?label=last%20update)
- keras-rl/keras-rl - rl/keras-rl)](https://github.com/keras-rl/keras-rl/stargazers) ![GitHub last commit](https://img.shields.io/github/last-commit/keras-rl/keras-rl?label=last%20update)
- SoyGema/Startcraft_pysc2_minigames
- thu-ml/tianshou - ml/tianshou)](https://github.com/thu-ml/tianshou/stargazers) ![GitHub last commit](https://img.shields.io/github/last-commit/thu-ml/tianshou?label=last%20update)
- DLR-RM/stable-baselines3 - a/stable-baselines](https://github.com/hill-a/stable-baselines) fork of [openai/baselines](https://github.com/openai/baselines)) [![GitHub stars](https://img.shields.io/github/stars/DLR-RM/stable-baselines3)](https://github.com/DLR-RM/stable-baselines3/stargazers) ![GitHub last commit](https://img.shields.io/github/last-commit/DLR-RM/stable-baselines3?label=last%20update)
- https://github.com/janhuenermann/neurojs - commit/janhuenermann/neurojs?label=last%20update)
- deepmind/open_spiel - commit/deepmind/open_spiel?label=last%20update)
- reinforceio/tensorforce - commit/reinforceio/tensorforce?label=last%20update)
- deepmind/trfl - commit/deepmind/trfl?label=last%20update)
- catalyst-team/catalyst - team/catalyst)](https://github.com/catalyst-team/catalyst/stargazers) ![GitHub last commit](https://img.shields.io/github/last-commit/catalyst-team/catalyst?label=last%20update)
- deepmind/acme - commit/deepmind/acme?label=last%20update)
- rll/rllab - commit/rll/rllab?label=last%20update)
- tensorflow/agents - commit/tensorflow/agents?label=last%20update)
- astooke/rlpyt - commit/astooke/rlpyt?label=last%20update)
- rail-berkeley/rlkit - berkeley/rlkit)](https://github.com/rail-berkeley/rlkit/stargazers) ![GitHub last commit](https://img.shields.io/github/last-commit/rail-berkeley/rlkit?label=last%20update)
- vwxyzjn/cleanrl - commit/vwxyzjn/cleanrl?label=last%20update)
- oxwhirl/pymarl - agent reinforcement learning [![GitHub stars](https://img.shields.io/github/stars/oxwhirl/pymarl)](https://github.com/oxwhirl/pymarl/stargazers) ![GitHub last commit](https://img.shields.io/github/last-commit/oxwhirl/pymarl?label=last%20update)
- deepmind/bsuite - commit/deepmind/bsuite?label=last%20update)
- chainer/chainerrl - commit/chainer/chainerrl?label=last%20update)
- facebookresearch/rl - commit/facebookresearch/rl?label=last%20update)
- MushroomRL/mushroom-rl - rl)](https://github.com/MushroomRL/mushroom-rl/stargazers) ![GitHub last commit](https://img.shields.io/github/last-commit/MushroomRL/mushroom-rl?label=last%20update)
- SurrealAI/surreal - commit/SurrealAI/surreal?label=last%20update)
- medipixel/rl_algorithms - commit/medipixel/rl_algorithms?label=last%20update)
- ikostrikov/jaxrl2 - commit/ikostrikov/jaxrl2?label=last%20update)
- ikostrikov/jaxrl - commit/ikostrikov/jaxrl?label=last%20update)
- tinkoff-ai/CORL - quality single-file implementations of SOTA Offline RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC" [![GitHub stars](https://img.shields.io/github/stars/tinkoff-ai/CORL)](https://github.com/tinkoff-ai/CORL/stargazers) ![GitHub last commit](https://img.shields.io/github/last-commit/tinkoff-ai/CORL?label=last%20update)
- learnables/cherry - commit/learnables/cherry?label=last%20update)
- trackmania-rl/tmrl - rl/tmrl)](https://github.com/trackmania-rl/tmrl/stargazers) ![GitHub last commit](https://img.shields.io/github/last-commit/trackmania-rl/tmrl?label=last%20update)
- ethanluoyc/magi - commit/ethanluoyc/magi?label=last%20update)
- RL-Glue - glue-ext/wikis/RLGlueCore.wiki)) (API: C/C++, Java, Matlab, Python, Lisp) (support: Alberta)
- tensorflow/tensorflow - commit/tensorflow/tensorflow?label=last%20update)
- pytorch/pytorch - commit/pytorch/pytorch?label=last%20update)
- keras-team/keras - team/keras)](https://github.com/keras-team/keras/stargazers) ![GitHub last commit](https://img.shields.io/github/last-commit/keras-team/keras?label=last%20update)
- google/jax - commit/google/jax?label=last%20update)
- facebookresearch/mbrl-lib - lib)](https://github.com/facebookresearch/mbrl-lib/stargazers) ![GitHub last commit](https://img.shields.io/github/last-commit/facebookresearch/mbrl-lib?label=last%20update)
- haarnoja/sac
- Asap7772/PTR - Training for Robots: Leveraging Diverse Multitask Data via Offline Reinforcement Learning
- ikostrikov/pytorch-a2c-ppo-acktr
- openai/spinningup - commit/openai/spinningup?label=last%20update)
- qfettes/DeepRL-Tutorials - Tutorials)](https://github.com/qfettes/DeepRL-Tutorials/stargazers) ![GitHub last commit](https://img.shields.io/github/last-commit/qfettes/DeepRL-Tutorials?label=last%20update)
- Farama-Foundation/Gymnasium - Foundation/Gymnasium)](https://github.com/Farama-Foundation/Gymnasium/stargazers) ![GitHub last commit](https://img.shields.io/github/last-commit/Farama-Foundation/Gymnasium?label=last%20update). ~~DEPRECATED: [openai/gym](https://github.com/openai/gym), <https://gym.openai.com>, <https://gym.openai.com/docs/>~~
- Farama-Foundation/Gymnasium-Robotics
- Farama-Foundation/Minigrid
- Farama-Foundation/MiniWorld
- Farama-Foundation/ViZDoom
- openai/roboschool
- Unity-Technologies/ml-agents
- Unity-Technologies/obstacle-tower-env
- LucasAlegre/sumo-rl
- qgallouedec/panda-gym
- NVIDIA-Omniverse/IsaacGymEnvs
- leggedrobotics/legged_gym
- osudrl/cassie-mujoco-sim
- utiasDSL/safe-control-gym
- deepmind/bsuite
- openai/gym-soccer
- erlerobot/gym-gazebo
- robotology/gym-ignition
- dartsim/gym-dart
- Roboy/gym-roboy
- kngwyu/mujoco-maze
- Improbable-AI/walk-these-ways
- ucuapps/modelicagym
- openai/safety-gym
- openai/retro
- deepmind/pysc2
- benelot/pybullet-gym
- Healthcare-Robotics/assistive-gym
- Microsoft/malmo
- nadavbh12/Retro-Learning-Environment
- twitter/torch-twrl
- duckietown/gym-duckietown
- arex18/rocket-lander
- ppaquette/gym-doom
- eleurent/highway-env
- thedimlebowski/Trading-Gym
- denisyarats/dmc2gym
- minerllabs/minerl
- eugenevinitsky/sequential_social_dilemma_games
- facebookresearch/minihack
- UtkarshMishra04/bioimitation-gym
- stanfordnmbl/osim-rl
- upb-lea/gym-electric-motor
- upb-lea/openmodelica-microgrid-gym
- tobirohrer/building-energy-storage-simulation
- intelligent-environments-lab/CityLearn
- koulanurag/ma-gym
- magni84/gym_bandits
- ThomasLecat/gym-bandit-environments
- JKCooper2/gym-bandits
- diegoalejogm/openai-k-armed-bandits
- Phylliade/awesome-openai-gym-environments
- Unity-Technologies/marathon-envs
- Farama-Foundation/PettingZoo - Foundation/PettingZoo)](https://github.com/Farama-Foundation/PettingZoo/stargazers) ![GitHub last commit](https://img.shields.io/github/last-commit/Farama-Foundation/PettingZoo?label=last%20update)
- Farama-Foundation/MAgent2
- Unity-Technologies/ml-agents
- LucasAlegre/sumo-rl
- Farama-Foundation/D4RL - berkeley/d4rl](https://github.com/rail-berkeley/d4rl)) [![GitHub stars](https://img.shields.io/github/stars/Farama-Foundation/D4RL)](https://github.com/Farama-Foundation/D4RL/stargazers) ![GitHub last commit](https://img.shields.io/github/last-commit/Farama-Foundation/D4RL?label=last%20update)
- google-research/rlds - research/rlds)](https://github.com/google-research/rlds/stargazers) ![GitHub last commit](https://img.shields.io/github/last-commit/google-research/rlds?label=last%20update)
- Farama-Foundation/Minari - Foundation/Kabuki](https://github.com/Farama-Foundation/Kabuki)) [![GitHub stars](https://img.shields.io/github/stars/Farama-Foundation/Minari)](https://github.com/Farama-Foundation/Minari/stargazers) ![GitHub last commit](https://img.shields.io/github/last-commit/Farama-Foundation/Minari?label=last%20update)
- google-research/robel - research/robel)](https://github.com/google-research/robel/stargazers) ![GitHub last commit](https://img.shields.io/github/last-commit/google-research/robel?label=last%20update)
- rlworkgroup/garage - commit/rlworkgroup/garage?label=last%20update)
- stepjam/RLBench - scale benchmark and learning environment." [![GitHub stars](https://img.shields.io/github/stars/stepjam/RLBench)](https://github.com/stepjam/RLBench/stargazers) ![GitHub last commit](https://img.shields.io/github/last-commit/stepjam/RLBench?label=last%20update)
- google-research/rliable - research/rliable)](https://github.com/google-research/rliable/stargazers) ![GitHub last commit](https://img.shields.io/github/last-commit/google-research/rliable?label=last%20update)
- google-research/rl-reliability-metrics - research/rl-reliability-metrics)](https://github.com/google-research/rl-reliability-metrics/stargazers) ![GitHub last commit](https://img.shields.io/github/last-commit/google-research/rl-reliability-metrics?label=last%20update)
- HYDesmondLiu/B2RL - commit/HYDesmondLiu/B2RL?label=last%20update)
- deepmind/bsuite - designed experiments that investigate core capabilities of a reinforcement learning (RL) agent" [![GitHub stars](https://img.shields.io/github/stars/deepmind/bsuite)](https://github.com/deepmind/bsuite/stargazers) ![GitHub last commit](https://img.shields.io/github/last-commit/deepmind/bsuite?label=last%20update)
- CMA-ES/pycma
- hardmaru/estool
- CyberAgent/cmaes
- tigerneil/awesome-deep-rl
- kengz/awesome-deep-rl
- williamd4112/awesome-deep-reinforcement-learning
- terryum/awesome-deep-learning-papers#new-papers
- hanjuku-kaso/awesome-offline-rl
- wwxFromTju/awesome-reinforcement-learning-lib
Programming Languages
Keywords
reinforcement-learning
49
machine-learning
33
deep-learning
30
pytorch
18
python
18
gym
14
openai-gym
12
robotics
11
deep-reinforcement-learning
11
neural-network
10
tensorflow
10
rl
7
dqn
6
gymnasium
6
ml
5
ai
4
torch
4
deep-neural-networks
4
simulation
4
jax
4
openai
4
sac
3
ppo
3
gym-environment
3
mujoco
3
awesome-list
3
research
3
a2c
3
keras
3
neural-networks
3
atari
3
computer-vision
3
chainer
3
offline-reinforcement-learning
3
actor-critic
3
rl-algorithms
3
pybullet
3
control
3
openai-gym-environments
3
gpu
2
benchmark
2
d4rl
2
distributed-computing
2
ddpg
2
soft-actor-critic
2
caffe2
2
double-dqn
2
mxnet
2
caffe
2
drl
2