Projects in Awesome Lists tagged with reinforcement-learning

https://github.com/MathFoundationRL/Book-Mathematical-Foundation-of-Reinforcement-Learning

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

book courses machine-learning reinforcement-learning tutorials

Last synced: 08 Aug 2024

https://github.com/farama-foundation/pettingzoo

An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities

api gym gymnasium multi-agent-reinforcement-learning multiagent-reinforcement-learning reinforcement-learning

Last synced: 29 Sep 2024

https://github.com/opendilab/DI-engine

OpenDILab Decision AI Engine

atari distributed-reinforcement-learning distributed-system drl exploration-exploitation imitation-learning impala inverse-reinforcement-learning minigrid model-based-reinforcement-learning mujoco multiagent-reinforcement-learning offline-rl python pytorch-rl r2d2 reinforcement-learning reinforcement-learning-algorithms self-play smac

Last synced: 01 Aug 2024

https://github.com/open-spaced-repetition/fsrs4anki

A modern Anki custom scheduling based on Free Spaced Repetition Scheduler algorithm

anki anki-addon deep-learning fsrs intelligent-tutoring-system machine-learning memory optimal-control reinforcement-learning spaced-repetition spaced-repetition-algorithm srs

Last synced: 30 Sep 2024

https://github.com/ashishpatel26/andrew-ng-notes

This is Andrew NG Coursera Handwritten Notes.

andrew-ng andrew-ng-course andrew-ng-machine-learning andrewng coursera coursera-machine-learning data-science deep-learning deep-neural-networks dl machine-learning ml neural-network neural-networks numpy pandas python pytorch reinforcement-learning

Last synced: 30 Sep 2024

https://github.com/Farama-Foundation/PettingZoo

An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities

api gym gymnasium multi-agent-reinforcement-learning multiagent-reinforcement-learning reinforcement-learning

Last synced: 01 Aug 2024

https://github.com/jayinai/data-science-question-answer

A repo for data science related questions and answers

data-science deep-learning machine-learning reinforcement-learning sql statistics system

Last synced: 30 Sep 2024

https://github.com/werner-duvaud/muzero-general

MuZero

alphago alphazero deep-learning deep-reinforcement-learning gym machine-learning mcts model-based-rl monte-carlo-tree-search muzero muzero-general neural-network python3 pytorch reinforcement-learning residual-network rl self-learning tensorboard

Last synced: 01 Oct 2024

https://github.com/eleurent/highway-env

A minimalist environment for decision-making in autonomous driving

autonomous-driving gym-environment reinforcement-learning

Last synced: 13 Aug 2024

https://github.com/Farama-Foundation/HighwayEnv

A minimalist environment for decision-making in autonomous driving

autonomous-driving gym-environment reinforcement-learning

Last synced: 31 Jul 2024

https://github.com/farama-foundation/highwayenv

A minimalist environment for decision-making in autonomous driving

autonomous-driving gym-environment reinforcement-learning

Last synced: 01 Oct 2024

https://github.com/IntelLabs/coach

Reinforcement Learning Coach by Intel AI Lab enables easy experimentation with state of the art Reinforcement Learning algorithms

carla coach deep-learning distributed-reinforcement-learning hierarchical-reinforcement-learning imitation-learning mujoco mxnet onnx openai-gym reinforcement-learning rl roboschool starcraft starcraft2 starcraft2-ai tensorflow

Last synced: 31 Jul 2024

https://github.com/intellabs/coach

Reinforcement Learning Coach by Intel AI Lab enables easy experimentation with state of the art Reinforcement Learning algorithms

carla coach deep-learning distributed-reinforcement-learning hierarchical-reinforcement-learning imitation-learning mujoco mxnet onnx openai-gym reinforcement-learning rl roboschool starcraft starcraft2 starcraft2-ai tensorflow

Last synced: 27 Sep 2024

https://github.com/ashishpatel26/Andrew-NG-Notes

This is Andrew NG Coursera Handwritten Notes.

andrew-ng andrew-ng-course andrew-ng-machine-learning andrewng coursera coursera-machine-learning data-science deep-learning deep-neural-networks dl machine-learning ml neural-network neural-networks numpy pandas python pytorch reinforcement-learning

Last synced: 03 Aug 2024

https://github.com/lgsvl/simulator

A ROS/ROS2 Multi-robot Simulator for Autonomous Vehicles

3d airsim api artificial-intelligence autonomous autoware baidu carla computer-vision csharp deep-learning game-engine machine-learning reinforcement-learning ros self-driving-car simulator tensorflow unity unreal-engine

Last synced: 29 Sep 2024

https://github.com/hzwer/iccv2019-learningtopaint

ICCV2019 - Learning to Paint With Model-based Deep Reinforcement Learning

computer-vision deep-learning painting pytorch reinforcement-learning

Last synced: 30 Sep 2024

https://github.com/awjuliani/deeprl-agents

A set of Deep Reinforcement Learning Agents implemented in Tensorflow.

reinforcement-learning tensorflow

Last synced: 30 Sep 2024

https://github.com/hzwer/ICCV2019-LearningToPaint

ICCV2019 - Learning to Paint With Model-based Deep Reinforcement Learning

computer-vision deep-learning painting pytorch reinforcement-learning

Last synced: 06 Aug 2024

https://github.com/awjuliani/DeepRL-Agents

A set of Deep Reinforcement Learning Agents implemented in Tensorflow.

reinforcement-learning tensorflow

Last synced: 30 Jul 2024

https://github.com/google-deepmind/mctx

Monte Carlo tree search in JAX

jax monte-carlo-tree-search reinforcement-learning

Last synced: 31 Jul 2024

https://github.com/deepmind/mctx

Monte Carlo tree search in JAX

jax monte-carlo-tree-search reinforcement-learning

Last synced: 04 Aug 2024

https://github.com/tirthajyoti/Papers-Literature-ML-DL-RL-AI

Highly cited and useful papers related to machine learning, deep learning, AI, game theory, reinforcement learning

artificial-intelligence data-mining data-science deep-learning game-theory hardware learning-theory literature machine-learning machine-learning-algorithms neural-network paper pattern-recognition reinforcement-learning silicon statistical-learning statistics

Last synced: 31 Jul 2024

https://github.com/allenai/rl4lms

A modular RL library to fine-tune language models to human preferences

dialogue-generation language-modeling machine-translation natural-language-processing nlp reinforcement-learning summarization table-to-text text-generation

Last synced: 01 Oct 2024

https://github.com/zeta36/chess-alpha-zero

Chess reinforcement learning by AlphaGo Zero methods.

alphago-zero chess keras reinforcement-learning tensorflow

Last synced: 26 Sep 2024

https://github.com/harderthenharder/transformers_tasks

⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.

information-extraction nlp reinforcement-learning text-classification text-generation text-matching transformers

Last synced: 30 Sep 2024

https://github.com/aminhp/gym-anytrading

The most simple, flexible, and comprehensive OpenAI Gym trading environment (Approved by OpenAI Gym)

dqn forex gym-environments openai-gym q-learning reinforcement-learning stocks trading trading-algorithms trading-environments

Last synced: 30 Sep 2024

https://github.com/allenai/RL4LMs

A modular RL library to fine-tune language models to human preferences

dialogue-generation language-modeling machine-translation natural-language-processing nlp reinforcement-learning summarization table-to-text text-generation

Last synced: 31 Jul 2024

https://github.com/openrlhf/openrlhf

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

deepspeed large-language-models raylib reinforcement-learning reinforcement-learning-from-human-feedback transformers vllm

Last synced: 30 Sep 2024

https://github.com/Zeta36/chess-alpha-zero

Chess reinforcement learning by AlphaGo Zero methods.

alphago-zero chess keras reinforcement-learning tensorflow

Last synced: 30 Jul 2024

https://github.com/facebookresearch/elf

An End-To-End, Lightweight and Flexible Platform for Game Research

artificial-intelligence cpp deep-learning gaming neural-network platform python reinforcement-learning

Last synced: 25 Sep 2024

https://github.com/facebookresearch/ELF

An End-To-End, Lightweight and Flexible Platform for Game Research

artificial-intelligence cpp deep-learning gaming neural-network platform python reinforcement-learning

Last synced: 31 Jul 2024

https://github.com/google/brax

Massively parallel rigidbody physics simulation on accelerator hardware.

jax physics-simulation reinforcement-learning robotics

Last synced: 30 Sep 2024

https://github.com/girafe-ai/ml-course

Open Machine Learning course

computer-vision course deep-learning machine-learning materials natural-language-processing python pytorch reinforcement-learning seminars

Last synced: 30 Sep 2024

https://github.com/AminHP/gym-anytrading

The most simple, flexible, and comprehensive OpenAI Gym trading environment (Approved by OpenAI Gym)

dqn forex gym-environments openai-gym q-learning reinforcement-learning stocks trading trading-algorithms trading-environments

Last synced: 02 Aug 2024

https://github.com/dlr-rm/rl-baselines3-zoo

A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.

deep-reinforcement-learning gym hyperparameter-optimization hyperparameter-search hyperparameter-tuning lab openai optimization pybullet pybullet-environments pytorch reinforcement-learning rl robotics sde stable-baselines tuning-hyperparameters

Last synced: 30 Sep 2024

https://github.com/lywangpx/reinforcement-learning-2nd-edition-by-sutton-exercise-solutions

Solutions of Reinforcement Learning, An Introduction

exercise-solutions reinforcement-learning self-study solutions

Last synced: 30 Sep 2024

https://github.com/letianzj/quantresearch

Quantitative analysis, strategies and backtests

algorithmic-trading algotrading asset-allocation asset-management backtesting-trading-strategies backtests data-science deep-learning derivatives-pricing financial-analysis machine-learning pairs-trading portfolio-management quantitative-finance quantitative-trading reinforcement-learning risk-management statistical-arbitrage trading-algorithms trading-strategies

Last synced: 30 Sep 2024

https://github.com/LyWangPX/Reinforcement-Learning-2nd-Edition-by-Sutton-Exercise-Solutions

Solutions of Reinforcement Learning, An Introduction

exercise-solutions reinforcement-learning self-study solutions

Last synced: 01 Aug 2024

https://github.com/DLR-RM/rl-baselines3-zoo

A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.

deep-reinforcement-learning gym hyperparameter-optimization hyperparameter-search hyperparameter-tuning lab openai optimization pybullet pybullet-environments pytorch reinforcement-learning rl robotics sde stable-baselines tuning-hyperparameters

Last synced: 02 Aug 2024

https://github.com/opendilab/ppoxfamily

PPO x Family DRL Tutorial Course（决策智能入门级公开课：8节课帮你盘清算法理论，理顺代码逻辑，玩转决策AI应用实践）

course decision-intelligence deep-reinforcement-learning python reinforcement-learning

Last synced: 30 Sep 2024

https://github.com/alessiodm/drl-zh

Deep Reinforcement Learning: Zero to Hero!

deep-learning deep-reinforcement-learning machine-learning reinforcement-learning

Last synced: 30 Sep 2024

https://github.com/facebookresearch/habitat-lab

A modular high-level library to train embodied AI agents across a variety of tasks and environments.

ai computer-vision deep-learning deep-reinforcement-learning python reinforcement-learning research robotics sim2real simulator

Last synced: 25 Sep 2024

https://github.com/OpenRLHF/OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

deepspeed large-language-models raylib reinforcement-learning reinforcement-learning-from-human-feedback transformers vllm

Last synced: 01 Aug 2024

https://github.com/curt-park/rainbow-is-all-you-need

Rainbow is all you need! A step-by-step tutorial from DQN to Rainbow

colab-notebook dqn gym-environment nbviewer pytorch rainbow reinforcement-learning

Last synced: 30 Sep 2024

https://github.com/pytorch/rl

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

ai control decision-making distributed-computing machine-learning marl model-based-reinforcement-learning multi-agent-reinforcement-learning pytorch reinforcement-learning rl robotics torch

Last synced: 29 Sep 2024

https://github.com/packtpublishing/advanced-deep-learning-with-keras

Advanced Deep Learning with Keras, published by Packt

autoencoder deep-learning gan keras reinforcement-learning vae

Last synced: 26 Sep 2024

https://github.com/wassimtenachi/physo

Physical Symbolic Optimization

deep-learning equation-discovery machine-learning physics python reinforcement-learning symbolic-regression

Last synced: 30 Sep 2024

https://github.com/Curt-Park/rainbow-is-all-you-need

Rainbow is all you need! A step-by-step tutorial from DQN to Rainbow

colab-notebook dqn gym-environment nbviewer pytorch rainbow reinforcement-learning

Last synced: 31 Jul 2024

https://github.com/WassimTenachi/PhySO

Physical Symbolic Optimization

deep-learning equation-discovery machine-learning physics python reinforcement-learning symbolic-regression

Last synced: 31 Jul 2024

https://github.com/yunyang1994/tensorflow2.0-examples

🙄 Difficult algorithm, Simple code.

convolutional-neural-network dcgan-tensorflow deep-learning deep-neural-networks fcn8s gan image-classification linear-regression machine-learning object-detection pix2pix reinforcement-learning resnet tensorflow tensorflow-examples tensorflow2 unet-image-segmentation vgg16 yolov3

Last synced: 26 Sep 2024

https://github.com/YunYang1994/TensorFlow2.0-Examples

🙄 Difficult algorithm, Simple code.

convolutional-neural-network dcgan-tensorflow deep-learning deep-neural-networks fcn8s gan image-classification linear-regression machine-learning object-detection pix2pix reinforcement-learning resnet tensorflow tensorflow-examples tensorflow2 unet-image-segmentation vgg16 yolov3

Last synced: 31 Jul 2024

https://github.com/geek-ai/magent

A Platform for Many-Agent Reinforcement Learning

deep-learning multi-agent reinforcement-learning

Last synced: 30 Sep 2024

https://github.com/Farama-Foundation/ViZDoom

Reinforcement Learning environments based on the 1993 game Doom :godmode:

deep-learning doom examples gym-environment gymnasium python reinforcement-learning vizdoom

Last synced: 04 Aug 2024

https://github.com/geek-ai/MAgent

A Platform for Many-Agent Reinforcement Learning

deep-learning multi-agent reinforcement-learning

Last synced: 30 Jul 2024

https://github.com/farama-foundation/vizdoom

Reinforcement Learning environments based on the 1993 game Doom :godmode:

deep-learning doom examples gym-environment gymnasium python reinforcement-learning vizdoom

Last synced: 30 Sep 2024

https://github.com/mwydmuch/ViZDoom

Reinforcement Learning environments based on the 1993 game Doom :godmode:

deep-learning doom examples gym-environment gymnasium python reinforcement-learning vizdoom

Last synced: 01 Aug 2024

https://github.com/letianzj/QuantResearch

Quantitative analysis, strategies and backtests

algorithmic-trading algotrading asset-allocation asset-management backtesting-trading-strategies backtests data-science deep-learning derivatives-pricing financial-analysis machine-learning pairs-trading portfolio-management quantitative-finance quantitative-trading reinforcement-learning risk-management statistical-arbitrage trading-algorithms trading-strategies

Last synced: 01 Aug 2024

https://github.com/pytorch/tnt

A lightweight library for PyTorch training tools and utilities

deep-learning machine-learning neural-network python pytorch reinforcement-learning

Last synced: 29 Sep 2024

https://github.com/nikhilbarhate99/ppo-pytorch

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

deep-learning deep-reinforcement-learning policy-gradient ppo ppo-pytorch proximal-policy-optimization pytorch pytorch-implmention pytorch-tutorial reinforcement-learning reinforcement-learning-algorithms

Last synced: 30 Sep 2024

https://github.com/chuyangliu/snake

Artificial intelligence for the Snake game.

ai algorithm artificial-intelligence deep-reinforcement-learning game graph-theory python reinforcement-learning snake snake-ai

Last synced: 30 Sep 2024

https://github.com/uber-research/deep-neuroevolution

Deep Neuroevolution

ai deep-neuroevolution machine-learning reinforcement-learning

Last synced: 01 Aug 2024

https://github.com/yvictor/tradinggym

Trading and Backtesting environment for training reinforcement learning agent or simple rule base algo.

backtest backtesting-trading-strategies python reinforcement-learning trading trading-api trading-bot trading-platform trading-simulator trading-strategies

Last synced: 30 Sep 2024

https://github.com/nikhilbarhate99/PPO-PyTorch

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

deep-learning deep-reinforcement-learning policy-gradient ppo ppo-pytorch proximal-policy-optimization pytorch pytorch-implmention pytorch-tutorial reinforcement-learning reinforcement-learning-algorithms

Last synced: 02 Aug 2024

https://github.com/BindsNET/bindsnet

Simulation of spiking neural networks (SNNs) using PyTorch.

dynamic gpu-computing machine-learning neurons pytorch reinforcement-learning simulation snn spiking-neural-networks stdp synapse

Last synced: 02 Aug 2024

https://github.com/bindsnet/bindsnet

Simulation of spiking neural networks (SNNs) using PyTorch.

dynamic gpu-computing machine-learning neurons pytorch reinforcement-learning simulation snn spiking-neural-networks stdp synapse

Last synced: 25 Sep 2024

https://github.com/Yvictor/TradingGym

Trading and Backtesting environment for training reinforcement learning agent or simple rule base algo.

backtest backtesting-trading-strategies python reinforcement-learning trading trading-api trading-bot trading-platform trading-simulator trading-strategies

Last synced: 31 Jul 2024

https://github.com/TorchCraft/TorchCraft

Connecting Torch to StarCraft

bwapi deep-learning machine-learning reinforcement-learning starcraft torch torchcraft

Last synced: 30 Jul 2024

https://github.com/trademaster-ntu/trademaster

TradeMaster is an open-source platform for quantitative trading empowered by reinforcement learning :fire: :zap: :rainbow:

finance fintech investment-strategies jupyter-notebook machine-learning python pytorch quantitative-trading reinforcement-learning stock-market trading-platform

Last synced: 26 Sep 2024

https://github.com/omarsar/nlp_overview

Overview of Modern Deep Learning Techniques Applied to Natural Language Processing

cnn deep-learning nlp reinforcement-learning rnn word-embeddings

Last synced: 30 Sep 2024

https://github.com/mossr/beautifulalgorithms.jl

Concise and beautiful algorithms written in Julia

algorithms decision-making-under-uncertainty julia machine-learning neural-network optimization quine regression reinforcement-learning sorting

Last synced: 30 Sep 2024

https://github.com/Ceruleanacg/Personae

📈 Personae is a repo of implements and environment of Deep Reinforcement Learning & Supervised Learning for Quantitative Trading.

paper reinforcement-learning stock stock-data stock-price-prediction supervised-learning time-series-prediction trading

Last synced: 31 Jul 2024

https://github.com/mossr/BeautifulAlgorithms.jl

Concise and beautiful algorithms written in Julia

algorithms decision-making-under-uncertainty julia machine-learning neural-network optimization quine regression reinforcement-learning sorting

Last synced: 31 Jul 2024

https://github.com/danijar/dreamerv3

Mastering Diverse Domains through World Models

artificial-intelligence general jax minecraft reinforcement-learning world-models

Last synced: 30 Sep 2024

https://github.com/keon/deep-q-learning

Minimal Deep Q Learning (DQN & DDQN) implementations in Keras

ddqn deep-learning deep-q-network deep-reinforcement-learning dqn reinforcement-learning

Last synced: 26 Sep 2024

https://github.com/charlesXu86/Chatbot_CN

基于金融-司法领域(兼有闲聊性质)的聊天机器人，其中的主要模块有信息抽取、NLU、NLG、知识图谱等，并且利用Django整合了前端展示,目前已经封装了nlp和kg的restful接口

attention-mechanism chatbot-cn deep-learning dialogue-systems django-restful intent-detection ir knowledge-graph ner nlg nlu oriented-dialogs recommendation reinforcement-learning sentiment-analysis slot-filling tenserflow-serving tensorflow text-classification text-correct

Last synced: 01 Aug 2024

https://github.com/arise-initiative/robosuite

robosuite: A Modular Simulation Framework and Benchmark for Robot Learning

physics-simulation reinforcement-learning robot-learning robot-manipulation robotics

Last synced: 01 Oct 2024

https://github.com/kengz/slm-lab

Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".

a2c a3c benchmark deep-reinforcement-learning dqn policy-gradient ppo pytorch reinforcement-learning sac

Last synced: 25 Sep 2024

https://github.com/TradeMaster-NTU/TradeMaster

TradeMaster is an open-source platform for quantitative trading empowered by reinforcement learning :fire: :zap: :rainbow:

finance fintech investment-strategies jupyter-notebook machine-learning python pytorch quantitative-trading reinforcement-learning stock-market trading-platform

Last synced: 01 Aug 2024

https://github.com/ikatsov/tensor-house

A collection of reference Jupyter notebooks and demo AI/ML applications for enterprise use cases: marketing, pricing, supply chain, smart manufacturing, and more.

ai customer-analysis data-science deep-learning llm machine-learning marketing models personalization reinforcement-learning supply-chain

Last synced: 30 Sep 2024

https://github.com/kengz/SLM-Lab

Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".

a2c a3c benchmark deep-reinforcement-learning dqn policy-gradient ppo pytorch reinforcement-learning sac

Last synced: 01 Aug 2024

https://github.com/microsoft/textworld

TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.

reinforcement-learning text-based-adventure text-based-game

Last synced: 30 Sep 2024

https://github.com/rail-berkeley/softlearning

Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.

deep-learning deep-neural-networks deep-reinforcement-learning machine-learning reinforcement-learning soft-actor-critic

Last synced: 30 Sep 2024

https://github.com/morvanzhou/evolutionary-algorithm

Evolutionary Algorithm using Python, 莫烦Python 中文AI教学

distributed-es es evolution-strategies evolution-strategy evolutionary-algorithm genetic-algorithm machine-learning microbial-ga microbial-genetic-algorithm neat nes neural-nets neural-network neuroevolution openai python reinforcement-learning travel-sale-problem travel-sales-problem tutorial

Last synced: 30 Sep 2024

https://github.com/Microsoft/TextWorld

TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.

reinforcement-learning text-based-adventure text-based-game

Last synced: 03 Aug 2024

https://github.com/microsoft/TextWorld

TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.

reinforcement-learning text-based-adventure text-based-game

Last synced: 01 Aug 2024

https://github.com/chainer/chainerrl

ChainerRL is a deep reinforcement learning library built on top of Chainer.

actor-critic chainer deep-learning dqn machine-learning python reinforcement-learning

Last synced: 25 Sep 2024

https://github.com/sudharsan13296/Hands-On-Meta-Learning-With-Python

Learning to Learn using One-Shot Learning, MAML, Reptile, Meta-SGD and more with Tensorflow

deep-meta-learning few-shot-learning keras maml mann matching-networks meta-imitation-learning meta-sgd metalearning ntm one-shot-learning prototypical-network prototypical-networks reinforcement-learning relation-network reptile shot-learning siamese-network tensorflow zero-shot-learning

Last synced: 02 Aug 2024

https://github.com/pku-alignment/safe-rlhf

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

ai-safety alpaca beaver datasets deepspeed gpt large-language-models llama llm llms reinforcement-learning reinforcement-learning-from-human-feedback rlhf safe-reinforcement-learning safe-reinforcement-learning-from-human-feedback safe-rlhf safety transformer transformers vicuna

Last synced: 27 Sep 2024

https://github.com/PKU-Alignment/safe-rlhf

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

ai-safety alpaca beaver datasets deepspeed gpt large-language-models llama llm llms reinforcement-learning reinforcement-learning-from-human-feedback rlhf safe-reinforcement-learning safe-reinforcement-learning-from-human-feedback safe-rlhf safety transformer transformers vicuna

Last synced: 03 Aug 2024

https://github.com/utiasDSL/gym-pybullet-drones

PyBullet Gymnasium environments for single and multi-agent reinforcement learning of quadcopter control

betaflight control crazyflie gym gymnasium multi-agent pybullet quadcopter quadrotor reinforcement-learning robotics sitl stable-baselines3 uav

Last synced: 31 Jul 2024

https://github.com/araffin/rl-baselines-zoo

A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.

gym hyperparameter-optimization hyperparameter-search hyperparameter-tuning hyperparameters openai openai-gym optimization pybullet reinforcement-learning rl stable-baselines zoo

Last synced: 03 Oct 2024

https://github.com/aitorzip/deepgtav

A plugin for GTAV that transforms it into a vision-based self-driving car research environment.

dataset-generation deep-learning gtav reinforcement-learning self-driving-car

Last synced: 30 Sep 2024

https://github.com/patrick-llgc/learning-deep-learning

Paper reading notes on Deep Learning and Machine Learning

3d-object-detection 3d-object-recognition cnn computer-vision deep-learning literature-review machine-learning medical medical-imaging paper paper-reading paper-review point-cloud reinforcement-learning

Last synced: 30 Sep 2024

https://github.com/aitorzip/DeepGTAV

A plugin for GTAV that transforms it into a vision-based self-driving car research environment.

dataset-generation deep-learning gtav reinforcement-learning self-driving-car

Last synced: 02 Aug 2024

https://github.com/khrylx/pytorch-rl

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

a2c deep-reinforcement-learning fisher-vectors generative-adversarial-network policy-gradient ppo proximal-policy-optimization pytorch pytorch-rl reinforcement-learning trpo

Last synced: 30 Sep 2024

https://github.com/quantumiracle/popular-rl-algorithms

PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..

reinforcement-learning soft-actor-critic state-of-the-art

Last synced: 02 Oct 2024