Projects in Awesome Lists tagged with rl

https://github.com/google/dopamine

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

ai google ml rl tensorflow

Last synced: 16 Dec 2024

https://google.github.io/dopamine/

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

ai google ml rl tensorflow

Last synced: 12 Nov 2024

https://github.com/thu-ml/tianshou

An elegant PyTorch deep reinforcement learning library.

a2c atari bcq cql ddpg double-dqn dqn drl imitation-learning mujoco npg policy-gradient ppo pytorch rl sac td3 transferlab trpo

Last synced: 16 Dec 2024

https://github.com/pytorch/elf

ELF: a platform for game research with AlphaGoZero/AlphaZero reimplementation

alpha-zero alphago-zero go reinforcement-learning rl rl-environment

Last synced: 26 Sep 2024

https://github.com/junxiaosong/alphazero_gomoku

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

alphago alphago-zero alphazero board-game gobang gomoku mcts monte-carlo-tree-search pytorch reinforcement-learning rl self-learning tensorflow

Last synced: 19 Dec 2024

https://github.com/pytorch/ELF

ELF: a platform for game research with AlphaGoZero/AlphaZero reimplementation

alpha-zero alphago-zero go reinforcement-learning rl rl-environment

Last synced: 02 Nov 2024

https://github.com/junxiaosong/AlphaZero_Gomoku

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

alphago alphago-zero alphazero board-game gobang gomoku mcts monte-carlo-tree-search pytorch reinforcement-learning rl self-learning tensorflow

Last synced: 18 Nov 2024

https://github.com/werner-duvaud/muzero-general

MuZero

alphago alphazero deep-learning deep-reinforcement-learning gym machine-learning mcts model-based-rl monte-carlo-tree-search muzero muzero-general neural-network python3 pytorch reinforcement-learning residual-network rl self-learning tensorboard

Last synced: 18 Dec 2024

https://github.com/pytorch/rl

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

ai control decision-making distributed-computing machine-learning marl model-based-reinforcement-learning multi-agent-reinforcement-learning pytorch reinforcement-learning rl robotics torch

Last synced: 21 Dec 2024

https://github.com/IntelLabs/coach

Reinforcement Learning Coach by Intel AI Lab enables easy experimentation with state of the art Reinforcement Learning algorithms

carla coach deep-learning distributed-reinforcement-learning hierarchical-reinforcement-learning imitation-learning mujoco mxnet onnx openai-gym reinforcement-learning rl roboschool starcraft starcraft2 starcraft2-ai tensorflow

Last synced: 31 Oct 2024

https://github.com/intellabs/coach

Reinforcement Learning Coach by Intel AI Lab enables easy experimentation with state of the art Reinforcement Learning algorithms

carla coach deep-learning distributed-reinforcement-learning hierarchical-reinforcement-learning imitation-learning mujoco mxnet onnx openai-gym reinforcement-learning rl roboschool starcraft starcraft2 starcraft2-ai tensorflow

Last synced: 27 Sep 2024

https://github.com/dlr-rm/rl-baselines3-zoo

A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.

deep-reinforcement-learning gym hyperparameter-optimization hyperparameter-search hyperparameter-tuning lab openai optimization pybullet pybullet-environments pytorch reinforcement-learning rl robotics sde stable-baselines tuning-hyperparameters

Last synced: 21 Dec 2024

https://github.com/DLR-RM/rl-baselines3-zoo

A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.

deep-reinforcement-learning gym hyperparameter-optimization hyperparameter-search hyperparameter-tuning lab openai optimization pybullet pybullet-environments pytorch reinforcement-learning rl robotics sde stable-baselines tuning-hyperparameters

Last synced: 12 Nov 2024

https://github.com/pathak22/noreward-rl

[ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep Reinforcement Learning

curiosity deep-learning deep-neural-networks deep-reinforcement-learning doom exploration mario openai-gym rl self-supervised tensorflow

Last synced: 15 Dec 2024

https://github.com/maximevandegar/papers-in-100-lines-of-code

Implementation of papers in 100 lines of code.

3d aes artificial-intelligence deep-learning diffusion-models educational gans generative-model implementation-of-research-paper inverse-rendering machine-learning meta-learning nerf neural-radiance-fields papers python pytorch reinforcement-learning research rl

Last synced: 20 Dec 2024

https://github.com/araffin/rl-baselines-zoo

A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.

gym hyperparameter-optimization hyperparameter-search hyperparameter-tuning hyperparameters openai openai-gym optimization pybullet reinforcement-learning rl stable-baselines zoo

Last synced: 15 Dec 2024

https://github.com/erlerobot/gym-gazebo

Refer to https://github.com/AcutronicRobotics/gym-gazebo2 for the new version

deep-reinforcement-learning drl gazebo openai-gym reinforcement-learning rl robotics ros

Last synced: 13 Nov 2024

https://github.com/google-research/seed_rl

SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's architecture.

atari deepmind-lab gcp google-research-football impala r2d2 rl tf2

Last synced: 27 Oct 2024

https://github.com/MushroomRL/mushroom-rl

Python library for Reinforcement Learning.

atari ddpg deep-learning deep-reinforcement-learning dqn mujoco openai-gym pybullet pytorch qlearning reinforcement-learning rl sac trpo

Last synced: 02 Nov 2024

https://github.com/google-research/rliable

[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.

benchmarking evaluation-metrics google machine-learning reinforcement-learning rl

Last synced: 15 Dec 2024

https://github.com/MaximeVandegar/Papers-in-100-Lines-of-Code

Implementation of papers in 100 lines of code.

3d aes artificial-intelligence deep-learning diffusion-models educational gans generative-model implementation-of-research-paper inverse-rendering machine-learning meta-learning nerf neural-radiance-fields papers python pytorch reinforcement-learning research rl

Last synced: 21 Nov 2024

https://github.com/utilforever/rosettastone

Hearthstone simulator using C++ with some reinforcement learning

cplusplus cpp cpp17 hearthstone hearthstone-api hearthstone-simulator python-api reinforcement-learning rl rl-environment simulator-game

Last synced: 21 Dec 2024

https://github.com/utilForever/RosettaStone

Hearthstone simulator using C++ with some reinforcement learning

cplusplus cpp cpp17 hearthstone hearthstone-api hearthstone-simulator python-api reinforcement-learning rl rl-environment simulator-game

Last synced: 07 Nov 2024

https://github.com/araffin/rl-tutorial-jnrr19

Stable-Baselines tutorial for Journées Nationales de la Recherche en Robotique 2019

colab-notebook notebook python reinforcement-learning rl stable-baselines tutorial

Last synced: 20 Dec 2024

https://github.com/ashishpatel26/real-time-ml-project

A curated list of applied machine learning and data science notebooks and libraries across different industries.

application deep-learning deeplearning dl keras machine-learning machine-learning-algorithms machinelearning ml ml-application project pytorch real-time real-time-data rl tensorflow theano

Last synced: 18 Dec 2024

https://github.com/neptune-ai/neptune-client

📘 The experiment tracker for foundation model training

comparison dl foundation keras learning lightgbm llm logger logging machine ml mlops monitoring optuna pytorch rl tensorflow versioning visualization xgboost

Last synced: 18 Dec 2024

https://github.com/chenglongchen/pytorch-drl

PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.

a2c acktr actor-critic advantage-actor-critic ddpg deep-deterministic-policy-gradient deep-q-network deep-reinforcement-learning dqn drl madrl multi-agent ppo proximal-policy-optimization pytorch reinforcement-learning rl

Last synced: 16 Dec 2024

https://github.com/Toni-SM/skrl

Modular reinforcement learning library (on PyTorch and JAX) with support for NVIDIA Isaac Gym, Omniverse Isaac Gym and Isaac Lab

deep-learning deepmind gym gymnasium isaac-gym isaac-lab isaac-orbit isaac-sim isaaclab jax machine-learning nvidia-omniverse openai-gym python pytorch reinforcement-learning rl robosuite robotics skrl

Last synced: 03 Nov 2024

https://github.com/peteanderson80/matterport3dsimulator

AI Research Platform for Reinforcement Learning from Real Panoramic Images.

matterport3d-dataset matterport3d-simulator natural-language-processing reinforcement-learning rl simulator vision-and-language

Last synced: 21 Dec 2024

https://github.com/ashishpatel26/Real-time-ML-Project

A curated list of applied machine learning and data science notebooks and libraries across different industries.

application deep-learning deeplearning dl keras machine-learning machine-learning-algorithms machinelearning ml ml-application project pytorch real-time real-time-data rl tensorflow theano

Last synced: 01 Nov 2024

https://github.com/AcutronicRobotics/gym-gazebo2

gym-gazebo2 is a toolkit for developing and comparing reinforcement learning algorithms using ROS 2 and Gazebo

deep-reinforcement-learning drl gazebo gym reinforcement-learning rl robotics ros ros2

Last synced: 02 Nov 2024

https://github.com/wil3/gymfc

A universal flight control tuning framework

benchmark drone flight-controller gazebo gazebo-plugin gazebo-simulator machinelearning openai openai-gym openai-gym-environments quadcopter reinforcement-learning rl robotics uav

Last synced: 15 Dec 2024

https://github.com/mishalaskin/rad

RAD: Reinforcement Learning with Augmented Data

codebase data- data-augmentations deep-learning deep-learning-algorithms deep-neural-networks deep-q-learning deep-q-network deep-reinforcement-learning deeplearning-ai dm-control model-free mujoc off-policy ppo rad reinforcement-learning rl sac soft-actor-critic

Last synced: 16 Dec 2024

https://github.com/denisyarats/drq

DrQ: Data regularized Q

actor-critic control data-augmentation deep-learning deep-reinforcement-learning dm-control drq gym model-free mujoco off-policy pixel python pytorch reinforcement-learning rl sac soft-actor-crit

Last synced: 13 Nov 2024

https://github.com/lucasalegre/morl-baselines

Multi-Objective Reinforcement Learning algorithms implementations.

gym gymnasium mo-gymnasium morl multi-objective multi-objective-rl pytorch reinforcement-learning rl rl-algorithms

Last synced: 21 Dec 2024

https://github.com/araffin/learning-to-drive-in-5-minutes

Implementation of reinforcement learning approach to make a car learn to drive smoothly in minutes

donkey-car gym openai reinforcement-learning rl sac self-driving-car simulator soft-actor-critic srl stable-baselines state-representation-learning unity vae

Last synced: 27 Nov 2024

https://github.com/princeton-nlp/webshop

[NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents

decision-making language language-grounding ml nlp rl rl-environment shopping sim-to-real web-based

Last synced: 16 Dec 2024

https://github.com/bentrevett/pytorch-rl

Tutorials for reinforcement learning in PyTorch and Gym by implementing a few of the popular algorithms. [IN PROGRESS]

a2c actor-critic advantage-actor-critic generalized-advantage-estimation policy-gradient pytorch pytorch-implementation pytorch-implmention pytorch-rl pytorch-tutorial pytorch-tutorials reinforcement-learning reinforcement-learning-algorithms rl

Last synced: 30 Oct 2024

https://github.com/gsurma/atari

AI research environment for the Atari 2600 games 🤖.

ai artificial-intelligence atari breakout ddqn dqn gym machine-learning ml python python2 q-l reinforcement-learning rl space-invaders

Last synced: 17 Dec 2024

https://github.com/princeton-nlp/WebShop

[NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents

decision-making language language-grounding ml nlp rl rl-environment shopping sim-to-real web-based

Last synced: 09 Nov 2024

https://github.com/bakkesmodorg/bakkesmodsdk

The current BakkesModSDK (Unofficial SDK for Rocket League)

api game league mod modding plugin plugins rl rocket rocket-league sdk unofficial

Last synced: 20 Dec 2024

https://github.com/learnables/cherry

A PyTorch Library for Reinforcement Learning Research

learning pytorch reinforcement reinforcement-learning rl

Last synced: 20 Dec 2024

https://github.com/Draichi/cryptocurrency_prediction

:zap: :zap: 𝘋𝘦𝘦𝘱 𝘙𝘓 𝘈𝘭𝘨𝘰𝘵𝘳𝘢𝘥𝘪𝘯𝘨 𝘸𝘪𝘵𝘩 𝘙𝘢𝘺 𝘈𝘗𝘐

algotrading bot ray reinforcement-learning-bot rl rllib trading trading-bot

Last synced: 18 Dec 2024

https://github.com/Draichi/T-1000

:zap: :zap: 𝘋𝘦𝘦𝘱 𝘙𝘓 𝘈𝘭𝘨𝘰𝘵𝘳𝘢𝘥𝘪𝘯𝘨 𝘸𝘪𝘵𝘩 𝘙𝘢𝘺 𝘈𝘗𝘐

algotrading bot ray reinforcement-learning-bot rl rllib trading trading-bot

Last synced: 01 Nov 2024

https://github.com/utilforever/baba-is-auto

Baba Is You simulator using C++ with some reinforcement learning

baba-is-you babaisyou cplusplus cpp cpp17 python-api reinforcement-learning rl rl-environment simulator-game

Last synced: 28 Nov 2024

https://github.com/facebookresearch/benchmarl

A collection of MARL benchmarks based on TorchRL

benchmark machine-learning marl multi-agent multi-agent-reinforcement-learning pytorch reinforcement-learning rl robotics torch

Last synced: 16 Dec 2024

https://github.com/gordicaleksa/pytorch-learn-reinforcement-learning

A collection of various RL algorithms like policy gradients, DQN and PPO. The goal of this repo will be to make it a go-to resource for learning about RL. How to visualize, debug and solve RL problems. I've additionally included playground.py for learning more about OpenAI gym, etc.

deep-learning deep-q-network dqn jupyter policy-gradient ppo python pytorch pytorch-dqn pytorch-implementation pytorch-policy-gradient pytorch-ppo reinforcement-learning reinforcement-learning-algorithms rl

Last synced: 11 Nov 2024

https://github.com/mihirp1998/VADER

Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various reward models such as HPS, PickScore, VideoMAE, VJEPA, YOLO, Aesthetics etc.

alignment diffusion reinforcement-learning reinforcement-learning-human-feedback rl rlhf vader video-diffusion video-diffusion-alignment

Last synced: 31 Oct 2024

https://github.com/pathak22/exploration-by-disagreement

[ICML 2019] TensorFlow Code for Self-Supervised Exploration via Disagreement

artificial-curiosity artificial-intelligence curiosity deep-learning deep-reinforcement-learning exploration rl self-supervised tensorflow

Last synced: 14 Nov 2024

https://github.com/chendrag/mujoco-benchmark

Provide full reinforcement learning benchmark on mujoco environments, including ddpg, sac, td3, pg, a2c, ppo, library

baseline benchmark ddpg drl mujoco performance ppo pytorch results rl sac tianshou

Last synced: 27 Oct 2024

https://github.com/toshas/torch-discounted-cumsum

Fast Discounted Cumulative Sums in PyTorch

discounted-cumulative-sum pytorch reinforce reinforcement-learning rl

Last synced: 16 Nov 2024

https://github.com/opendilab/generativerl

Python library for solving reinforcement learning (RL) problems using generative models (e.g. Diffusion Models).

diffusion diffusion-models diffusion-policy flow-model generative-ai generative-model offline-rl reinforcement-learning rl

Last synced: 15 Dec 2024

https://github.com/shuaibinli/rl_carla

Train auto_car in CARLA simulator with RL algorithms(SAC).

carla rl

Last synced: 07 Nov 2024

https://github.com/cy69855522/ai-paper-drawer

人工智能论文关键点集结。This project aims to collect key points of AI papers.

ai-papers cv deep-learning gans gcn gnn graph nlp rl

Last synced: 01 Dec 2024

https://github.com/gsurma/slitherin

AI research environment for the game of Snake 🐍 .

ai bfs dfs dnn genetic-algorithm genetic-algorithms gym hamiltonian longest-path monte-carlo openai openai-gym python python27 requests-for-research rl slitherin-gym snake snake-game

Last synced: 03 Dec 2024

https://github.com/mymusise/trading-gym

A Trading environment base on Gym

drl gym python3 reinforcement-learning rl trading

Last synced: 16 Nov 2024

https://github.com/maluuba/hra

Hybrid Reward Architecture

reinforcement-learning rl

Last synced: 16 Nov 2024

https://github.com/stillonearth/bevy_rl

Reinforcement Learning environments with Bevy

bevy gym rl

Last synced: 20 Dec 2024

https://github.com/cloud-cv/evalai-starters

How to create a challenge on EvalAI?

agent ai cv data-science data-science-competition environments evalai get-started getting-started ml reinforcement-learning rl

Last synced: 18 Dec 2024

https://github.com/MathisWellmann/gym-rs

OpenAI's Gym written in pure Rust for blazingly fast performance

ai ml openai-gym reinforcement-learning rl

Last synced: 02 Nov 2024

https://github.com/losttech/gradient-samples

Samples for TensorFlow binding for .NET by Lost Tech

cnn cross-platform csharp deep-learning dotnet fsharp gpt-2 lstm reinforcement-learning resnet rl tensorflow tensorflow-binding tensorflow-examples tensorflow-tutorials unity-ml unity-ml-agents

Last synced: 12 Nov 2024

https://github.com/notedance/note

Machine learning library, Distributed training, Deep learning, Reinforcement learning, Models, TensorFlow, PyTorch

artificial-intelligence deep-learning deep-reinforcement-learning deeplearning deepreinforcementlearning distributed-training dl drl machine-learning machine-learning-library machinelearning ml neural-network neuralnetwork parallel-training pytorch reinforcement-learning reinforcementlearning rl tensorflow

Last synced: 19 Dec 2024

https://github.com/princeton-nlp/calm-textgame

[EMNLP 2020] Keep CALM and Explore: Language Models for Action Generation in Text-based Games

calm gpt n-gram nlp rl text-based-game

Last synced: 11 Nov 2024

https://github.com/traffic-alpha/illm-tsc

This repository contains the code for the paper“iLLM-TSC: Integration reinforcement learning and large language model for traffic signal control policy improvement”

llm reinforcement-learning rl tsc

Last synced: 20 Dec 2024

https://github.com/blackhc/mdp

Make it easy to specify simple MDPs that are compatible with the OpenAI Gym.

mdp openai-gym rl

Last synced: 09 Nov 2024

https://github.com/LucasWaelti/RL_Webots

Webots project to show how to use Deep Reinforcement Learning with Webots in C++.

cpp deep-reinforcement-learning libtorch policy-gradient python pytorch rl webots

Last synced: 16 Nov 2024

https://github.com/bytedance/raylink

Framework to build and train RL algorithms

reinforcement-learning rl

Last synced: 15 Nov 2024

https://github.com/sintefneodroid/droid

Package for rapid prototyping of reinforcement learning environments 🚀

agent blazing deep-learning droid fast hacktoberfest learning-agents machine-learning ml motor neo neodroid neural-network prototyping reinforcement-learning rl segment-images simulation unity

Last synced: 07 Nov 2024

https://github.com/thu-ml/srpo

Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" (ICLR 2024).

behavior-regularization d4rl diffusion generative offline reinforcement-learning rl score-based-models srpo

Last synced: 13 Nov 2024

https://github.com/crumblyliquid/bakkeslinux

Guide for running BakkesMod on Linux

bakkes bakkesmod league linux mod rl rocket rocket-league training

Last synced: 09 Oct 2024

https://github.com/ahmetfurkandemir/supermariobrosrl

Super Mario Bros training with Ray RLlib DQN algorithm

dqn-tensorflow keras ray reinforcement-learning rl rllib tensorflow

Last synced: 16 Nov 2024

https://github.com/en10/cartpole

Run OpenAI Gym on a Server

aws cartpole gym keras openai openai-gym reinforcement-learning rl

Last synced: 05 Dec 2024

https://github.com/jianzhnie/rltoolkit

RLToolkit is a flexible and high-efficient reinforcement learning framework. Include implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

a2c actor-critic ddpg ddqn dqn maddpg mappo ppo qmix rl sac td3 trpo

Last synced: 06 Dec 2024

https://github.com/utilforever/corailed

Unrailed! simulator using C++ with some reinforcement learning and Unrailed! AI using Python with OpenCV

cplusplus cpp cpp17 python python-api python3 reinforcement-learning rl rl-environment simulator-game unrailed

Last synced: 28 Nov 2024

https://github.com/thomas-yanxin/useful_information

记录个人觉得有用的相关资料信息【欢迎共建！】

backend cv datawhale gui latex paddleclas paddlenlp paddleocr paddlepaddle paddleseg pyqt5 rl

Last synced: 06 Dec 2024

https://github.com/modanesh/anomalous_rl_envs

Anomalous versions of OpenAI Gym and PyBullet3 environments

anomaly-detection openai-gym pybullet reinforcement-learning rl

Last synced: 24 Nov 2024

https://github.com/fer14/raice

Car racing RL agents in actual F1 tracks

cars dqn f1 neat policy-gradient reinforcement-learning rl sarsa

Last synced: 21 Nov 2024

https://github.com/timcsy/gymize

Unity and Python Reinforcement and Imitation Learning with Gymnasium and PettingZoo API.

3d gym gymnasium imitation-learning pettingzoo reinforcement-learning rl unity

Last synced: 10 Oct 2024

https://github.com/hyperplane-lab/rgbmanip

Official implementation of RGBManip (ICRA2024)

monocular-depth-estimation rl robotics

Last synced: 17 Nov 2024

https://github.com/kirili4ik/hrl-taxi

Solution for Taxi env using HRL (Hierarchical reinforcement learning) (2018)

hierarchical-reinforcement-learning maxq python3 rl

Last synced: 19 Nov 2024

https://github.com/modanesh/differential_ig

Source code for the differential saliency method used in "Re-understanding Finite-State Representations of Recurrent Policy Networks"

pytorch rl saliency xai

Last synced: 24 Nov 2024

https://github.com/princeton-nlp/blindfold-textgame

[NAACL 2021] Reading and Acting while Blindfolded: The Need for Semantics in Text Game Agents

naacl naacl2021 natural-language-processing nlp reinforcement-learning rl text-based-game text-game

Last synced: 11 Nov 2024

https://github.com/salesforce/gaea

Data and code for Salesforce Research paper, GAEA: Graph Augmentation for Equitable Access via Reinforcement Learning - https://arxiv.org/abs/2012.03900 . The paper provides methods for constraint graph augmentation and optimal facility placement problems

ai constraint-optimization deep-reinforcement-learning equity fairness-ai graph-algorithms graph-machine-learning graph-ml ml reinforcement-learning resource-management rl social-network social-network-analysis