Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists tagged with reinforcement-learning
A curated list of projects in awesome lists tagged with reinforcement-learning .
https://github.com/MathFoundationRL/Book-Mathematical-Foundation-of-Reinforcement-Learning
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
book courses machine-learning reinforcement-learning tutorials
Last synced: 08 Aug 2024
https://github.com/farama-foundation/pettingzoo
An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities
api gym gymnasium multi-agent-reinforcement-learning multiagent-reinforcement-learning reinforcement-learning
Last synced: 29 Sep 2024
https://github.com/opendilab/DI-engine
OpenDILab Decision AI Engine
atari distributed-reinforcement-learning distributed-system drl exploration-exploitation imitation-learning impala inverse-reinforcement-learning minigrid model-based-reinforcement-learning mujoco multiagent-reinforcement-learning offline-rl python pytorch-rl r2d2 reinforcement-learning reinforcement-learning-algorithms self-play smac
Last synced: 01 Aug 2024
https://github.com/open-spaced-repetition/fsrs4anki
A modern Anki custom scheduling based on Free Spaced Repetition Scheduler algorithm
anki anki-addon deep-learning fsrs intelligent-tutoring-system machine-learning memory optimal-control reinforcement-learning spaced-repetition spaced-repetition-algorithm srs
Last synced: 30 Sep 2024
https://github.com/ashishpatel26/andrew-ng-notes
This is Andrew NG Coursera Handwritten Notes.
andrew-ng andrew-ng-course andrew-ng-machine-learning andrewng coursera coursera-machine-learning data-science deep-learning deep-neural-networks dl machine-learning ml neural-network neural-networks numpy pandas python pytorch reinforcement-learning
Last synced: 30 Sep 2024
https://github.com/Farama-Foundation/PettingZoo
An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities
api gym gymnasium multi-agent-reinforcement-learning multiagent-reinforcement-learning reinforcement-learning
Last synced: 01 Aug 2024
https://github.com/jayinai/data-science-question-answer
A repo for data science related questions and answers
data-science deep-learning machine-learning reinforcement-learning sql statistics system
Last synced: 30 Sep 2024
https://github.com/werner-duvaud/muzero-general
MuZero
alphago alphazero deep-learning deep-reinforcement-learning gym machine-learning mcts model-based-rl monte-carlo-tree-search muzero muzero-general neural-network python3 pytorch reinforcement-learning residual-network rl self-learning tensorboard
Last synced: 01 Oct 2024
https://github.com/eleurent/highway-env
A minimalist environment for decision-making in autonomous driving
autonomous-driving gym-environment reinforcement-learning
Last synced: 13 Aug 2024
https://github.com/Farama-Foundation/HighwayEnv
A minimalist environment for decision-making in autonomous driving
autonomous-driving gym-environment reinforcement-learning
Last synced: 31 Jul 2024
https://github.com/farama-foundation/highwayenv
A minimalist environment for decision-making in autonomous driving
autonomous-driving gym-environment reinforcement-learning
Last synced: 01 Oct 2024
https://github.com/IntelLabs/coach
Reinforcement Learning Coach by Intel AI Lab enables easy experimentation with state of the art Reinforcement Learning algorithms
carla coach deep-learning distributed-reinforcement-learning hierarchical-reinforcement-learning imitation-learning mujoco mxnet onnx openai-gym reinforcement-learning rl roboschool starcraft starcraft2 starcraft2-ai tensorflow
Last synced: 31 Jul 2024
https://github.com/intellabs/coach
Reinforcement Learning Coach by Intel AI Lab enables easy experimentation with state of the art Reinforcement Learning algorithms
carla coach deep-learning distributed-reinforcement-learning hierarchical-reinforcement-learning imitation-learning mujoco mxnet onnx openai-gym reinforcement-learning rl roboschool starcraft starcraft2 starcraft2-ai tensorflow
Last synced: 27 Sep 2024
https://github.com/ashishpatel26/Andrew-NG-Notes
This is Andrew NG Coursera Handwritten Notes.
andrew-ng andrew-ng-course andrew-ng-machine-learning andrewng coursera coursera-machine-learning data-science deep-learning deep-neural-networks dl machine-learning ml neural-network neural-networks numpy pandas python pytorch reinforcement-learning
Last synced: 03 Aug 2024
https://github.com/lgsvl/simulator
A ROS/ROS2 Multi-robot Simulator for Autonomous Vehicles
3d airsim api artificial-intelligence autonomous autoware baidu carla computer-vision csharp deep-learning game-engine machine-learning reinforcement-learning ros self-driving-car simulator tensorflow unity unreal-engine
Last synced: 29 Sep 2024
https://github.com/hzwer/iccv2019-learningtopaint
ICCV2019 - Learning to Paint With Model-based Deep Reinforcement Learning
computer-vision deep-learning painting pytorch reinforcement-learning
Last synced: 30 Sep 2024
https://github.com/awjuliani/deeprl-agents
A set of Deep Reinforcement Learning Agents implemented in Tensorflow.
reinforcement-learning tensorflow
Last synced: 30 Sep 2024
https://github.com/hzwer/ICCV2019-LearningToPaint
ICCV2019 - Learning to Paint With Model-based Deep Reinforcement Learning
computer-vision deep-learning painting pytorch reinforcement-learning
Last synced: 06 Aug 2024
https://github.com/awjuliani/DeepRL-Agents
A set of Deep Reinforcement Learning Agents implemented in Tensorflow.
reinforcement-learning tensorflow
Last synced: 30 Jul 2024
https://github.com/google-deepmind/mctx
Monte Carlo tree search in JAX
jax monte-carlo-tree-search reinforcement-learning
Last synced: 31 Jul 2024
https://github.com/deepmind/mctx
Monte Carlo tree search in JAX
jax monte-carlo-tree-search reinforcement-learning
Last synced: 04 Aug 2024
https://github.com/tirthajyoti/Papers-Literature-ML-DL-RL-AI
Highly cited and useful papers related to machine learning, deep learning, AI, game theory, reinforcement learning
artificial-intelligence data-mining data-science deep-learning game-theory hardware learning-theory literature machine-learning machine-learning-algorithms neural-network paper pattern-recognition reinforcement-learning silicon statistical-learning statistics
Last synced: 31 Jul 2024
https://github.com/allenai/rl4lms
A modular RL library to fine-tune language models to human preferences
dialogue-generation language-modeling machine-translation natural-language-processing nlp reinforcement-learning summarization table-to-text text-generation
Last synced: 01 Oct 2024
https://github.com/zeta36/chess-alpha-zero
Chess reinforcement learning by AlphaGo Zero methods.
alphago-zero chess keras reinforcement-learning tensorflow
Last synced: 26 Sep 2024
https://github.com/harderthenharder/transformers_tasks
⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.
information-extraction nlp reinforcement-learning text-classification text-generation text-matching transformers
Last synced: 30 Sep 2024
https://github.com/aminhp/gym-anytrading
The most simple, flexible, and comprehensive OpenAI Gym trading environment (Approved by OpenAI Gym)
dqn forex gym-environments openai-gym q-learning reinforcement-learning stocks trading trading-algorithms trading-environments
Last synced: 30 Sep 2024
https://github.com/allenai/RL4LMs
A modular RL library to fine-tune language models to human preferences
dialogue-generation language-modeling machine-translation natural-language-processing nlp reinforcement-learning summarization table-to-text text-generation
Last synced: 31 Jul 2024
https://github.com/openrlhf/openrlhf
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
deepspeed large-language-models raylib reinforcement-learning reinforcement-learning-from-human-feedback transformers vllm
Last synced: 30 Sep 2024
https://github.com/Zeta36/chess-alpha-zero
Chess reinforcement learning by AlphaGo Zero methods.
alphago-zero chess keras reinforcement-learning tensorflow
Last synced: 30 Jul 2024
https://github.com/facebookresearch/elf
An End-To-End, Lightweight and Flexible Platform for Game Research
artificial-intelligence cpp deep-learning gaming neural-network platform python reinforcement-learning
Last synced: 25 Sep 2024
https://github.com/facebookresearch/ELF
An End-To-End, Lightweight and Flexible Platform for Game Research
artificial-intelligence cpp deep-learning gaming neural-network platform python reinforcement-learning
Last synced: 31 Jul 2024
https://github.com/google/brax
Massively parallel rigidbody physics simulation on accelerator hardware.
jax physics-simulation reinforcement-learning robotics
Last synced: 30 Sep 2024
https://github.com/girafe-ai/ml-course
Open Machine Learning course
computer-vision course deep-learning machine-learning materials natural-language-processing python pytorch reinforcement-learning seminars
Last synced: 30 Sep 2024
https://github.com/AminHP/gym-anytrading
The most simple, flexible, and comprehensive OpenAI Gym trading environment (Approved by OpenAI Gym)
dqn forex gym-environments openai-gym q-learning reinforcement-learning stocks trading trading-algorithms trading-environments
Last synced: 02 Aug 2024
https://github.com/dlr-rm/rl-baselines3-zoo
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
deep-reinforcement-learning gym hyperparameter-optimization hyperparameter-search hyperparameter-tuning lab openai optimization pybullet pybullet-environments pytorch reinforcement-learning rl robotics sde stable-baselines tuning-hyperparameters
Last synced: 30 Sep 2024
https://github.com/lywangpx/reinforcement-learning-2nd-edition-by-sutton-exercise-solutions
Solutions of Reinforcement Learning, An Introduction
exercise-solutions reinforcement-learning self-study solutions
Last synced: 30 Sep 2024
https://github.com/letianzj/quantresearch
Quantitative analysis, strategies and backtests
algorithmic-trading algotrading asset-allocation asset-management backtesting-trading-strategies backtests data-science deep-learning derivatives-pricing financial-analysis machine-learning pairs-trading portfolio-management quantitative-finance quantitative-trading reinforcement-learning risk-management statistical-arbitrage trading-algorithms trading-strategies
Last synced: 30 Sep 2024
https://github.com/LyWangPX/Reinforcement-Learning-2nd-Edition-by-Sutton-Exercise-Solutions
Solutions of Reinforcement Learning, An Introduction
exercise-solutions reinforcement-learning self-study solutions
Last synced: 01 Aug 2024
https://github.com/DLR-RM/rl-baselines3-zoo
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
deep-reinforcement-learning gym hyperparameter-optimization hyperparameter-search hyperparameter-tuning lab openai optimization pybullet pybullet-environments pytorch reinforcement-learning rl robotics sde stable-baselines tuning-hyperparameters
Last synced: 02 Aug 2024
https://github.com/opendilab/ppoxfamily
PPO x Family DRL Tutorial Course(决策智能入门级公开课:8节课帮你盘清算法理论,理顺代码逻辑,玩转决策AI应用实践 )
course decision-intelligence deep-reinforcement-learning python reinforcement-learning
Last synced: 30 Sep 2024
https://github.com/alessiodm/drl-zh
Deep Reinforcement Learning: Zero to Hero!
deep-learning deep-reinforcement-learning machine-learning reinforcement-learning
Last synced: 30 Sep 2024
https://github.com/facebookresearch/habitat-lab
A modular high-level library to train embodied AI agents across a variety of tasks and environments.
ai computer-vision deep-learning deep-reinforcement-learning python reinforcement-learning research robotics sim2real simulator
Last synced: 25 Sep 2024
https://github.com/OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
deepspeed large-language-models raylib reinforcement-learning reinforcement-learning-from-human-feedback transformers vllm
Last synced: 01 Aug 2024
https://github.com/curt-park/rainbow-is-all-you-need
Rainbow is all you need! A step-by-step tutorial from DQN to Rainbow
colab-notebook dqn gym-environment nbviewer pytorch rainbow reinforcement-learning
Last synced: 30 Sep 2024
https://github.com/pytorch/rl
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
ai control decision-making distributed-computing machine-learning marl model-based-reinforcement-learning multi-agent-reinforcement-learning pytorch reinforcement-learning rl robotics torch
Last synced: 29 Sep 2024
https://github.com/packtpublishing/advanced-deep-learning-with-keras
Advanced Deep Learning with Keras, published by Packt
autoencoder deep-learning gan keras reinforcement-learning vae
Last synced: 26 Sep 2024
https://github.com/wassimtenachi/physo
Physical Symbolic Optimization
deep-learning equation-discovery machine-learning physics python reinforcement-learning symbolic-regression
Last synced: 30 Sep 2024
https://github.com/Curt-Park/rainbow-is-all-you-need
Rainbow is all you need! A step-by-step tutorial from DQN to Rainbow
colab-notebook dqn gym-environment nbviewer pytorch rainbow reinforcement-learning
Last synced: 31 Jul 2024
https://github.com/WassimTenachi/PhySO
Physical Symbolic Optimization
deep-learning equation-discovery machine-learning physics python reinforcement-learning symbolic-regression
Last synced: 31 Jul 2024
https://github.com/yunyang1994/tensorflow2.0-examples
🙄 Difficult algorithm, Simple code.
convolutional-neural-network dcgan-tensorflow deep-learning deep-neural-networks fcn8s gan image-classification linear-regression machine-learning object-detection pix2pix reinforcement-learning resnet tensorflow tensorflow-examples tensorflow2 unet-image-segmentation vgg16 yolov3
Last synced: 26 Sep 2024
https://github.com/YunYang1994/TensorFlow2.0-Examples
🙄 Difficult algorithm, Simple code.
convolutional-neural-network dcgan-tensorflow deep-learning deep-neural-networks fcn8s gan image-classification linear-regression machine-learning object-detection pix2pix reinforcement-learning resnet tensorflow tensorflow-examples tensorflow2 unet-image-segmentation vgg16 yolov3
Last synced: 31 Jul 2024
https://github.com/geek-ai/magent
A Platform for Many-Agent Reinforcement Learning
deep-learning multi-agent reinforcement-learning
Last synced: 30 Sep 2024
https://github.com/Farama-Foundation/ViZDoom
Reinforcement Learning environments based on the 1993 game Doom :godmode:
deep-learning doom examples gym-environment gymnasium python reinforcement-learning vizdoom
Last synced: 04 Aug 2024
https://github.com/geek-ai/MAgent
A Platform for Many-Agent Reinforcement Learning
deep-learning multi-agent reinforcement-learning
Last synced: 30 Jul 2024
https://github.com/farama-foundation/vizdoom
Reinforcement Learning environments based on the 1993 game Doom :godmode:
deep-learning doom examples gym-environment gymnasium python reinforcement-learning vizdoom
Last synced: 30 Sep 2024
https://github.com/mwydmuch/ViZDoom
Reinforcement Learning environments based on the 1993 game Doom :godmode:
deep-learning doom examples gym-environment gymnasium python reinforcement-learning vizdoom
Last synced: 01 Aug 2024
https://github.com/letianzj/QuantResearch
Quantitative analysis, strategies and backtests
algorithmic-trading algotrading asset-allocation asset-management backtesting-trading-strategies backtests data-science deep-learning derivatives-pricing financial-analysis machine-learning pairs-trading portfolio-management quantitative-finance quantitative-trading reinforcement-learning risk-management statistical-arbitrage trading-algorithms trading-strategies
Last synced: 01 Aug 2024
https://github.com/pytorch/tnt
A lightweight library for PyTorch training tools and utilities
deep-learning machine-learning neural-network python pytorch reinforcement-learning
Last synced: 29 Sep 2024
https://github.com/nikhilbarhate99/ppo-pytorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
deep-learning deep-reinforcement-learning policy-gradient ppo ppo-pytorch proximal-policy-optimization pytorch pytorch-implmention pytorch-tutorial reinforcement-learning reinforcement-learning-algorithms
Last synced: 30 Sep 2024
https://github.com/chuyangliu/snake
Artificial intelligence for the Snake game.
ai algorithm artificial-intelligence deep-reinforcement-learning game graph-theory python reinforcement-learning snake snake-ai
Last synced: 30 Sep 2024
https://github.com/uber-research/deep-neuroevolution
Deep Neuroevolution
ai deep-neuroevolution machine-learning reinforcement-learning
Last synced: 01 Aug 2024
https://github.com/yvictor/tradinggym
Trading and Backtesting environment for training reinforcement learning agent or simple rule base algo.
backtest backtesting-trading-strategies python reinforcement-learning trading trading-api trading-bot trading-platform trading-simulator trading-strategies
Last synced: 30 Sep 2024
https://github.com/nikhilbarhate99/PPO-PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
deep-learning deep-reinforcement-learning policy-gradient ppo ppo-pytorch proximal-policy-optimization pytorch pytorch-implmention pytorch-tutorial reinforcement-learning reinforcement-learning-algorithms
Last synced: 02 Aug 2024
https://github.com/BindsNET/bindsnet
Simulation of spiking neural networks (SNNs) using PyTorch.
dynamic gpu-computing machine-learning neurons pytorch reinforcement-learning simulation snn spiking-neural-networks stdp synapse
Last synced: 02 Aug 2024
https://github.com/bindsnet/bindsnet
Simulation of spiking neural networks (SNNs) using PyTorch.
dynamic gpu-computing machine-learning neurons pytorch reinforcement-learning simulation snn spiking-neural-networks stdp synapse
Last synced: 25 Sep 2024
https://github.com/Yvictor/TradingGym
Trading and Backtesting environment for training reinforcement learning agent or simple rule base algo.
backtest backtesting-trading-strategies python reinforcement-learning trading trading-api trading-bot trading-platform trading-simulator trading-strategies
Last synced: 31 Jul 2024
https://github.com/TorchCraft/TorchCraft
Connecting Torch to StarCraft
bwapi deep-learning machine-learning reinforcement-learning starcraft torch torchcraft
Last synced: 30 Jul 2024
https://github.com/trademaster-ntu/trademaster
TradeMaster is an open-source platform for quantitative trading empowered by reinforcement learning :fire: :zap: :rainbow:
finance fintech investment-strategies jupyter-notebook machine-learning python pytorch quantitative-trading reinforcement-learning stock-market trading-platform
Last synced: 26 Sep 2024
https://github.com/omarsar/nlp_overview
Overview of Modern Deep Learning Techniques Applied to Natural Language Processing
cnn deep-learning nlp reinforcement-learning rnn word-embeddings
Last synced: 30 Sep 2024
https://github.com/mossr/beautifulalgorithms.jl
Concise and beautiful algorithms written in Julia
algorithms decision-making-under-uncertainty julia machine-learning neural-network optimization quine regression reinforcement-learning sorting
Last synced: 30 Sep 2024
https://github.com/Ceruleanacg/Personae
📈 Personae is a repo of implements and environment of Deep Reinforcement Learning & Supervised Learning for Quantitative Trading.
paper reinforcement-learning stock stock-data stock-price-prediction supervised-learning time-series-prediction trading
Last synced: 31 Jul 2024
https://github.com/mossr/BeautifulAlgorithms.jl
Concise and beautiful algorithms written in Julia
algorithms decision-making-under-uncertainty julia machine-learning neural-network optimization quine regression reinforcement-learning sorting
Last synced: 31 Jul 2024
https://github.com/danijar/dreamerv3
Mastering Diverse Domains through World Models
artificial-intelligence general jax minecraft reinforcement-learning world-models
Last synced: 30 Sep 2024
https://github.com/keon/deep-q-learning
Minimal Deep Q Learning (DQN & DDQN) implementations in Keras
ddqn deep-learning deep-q-network deep-reinforcement-learning dqn reinforcement-learning
Last synced: 26 Sep 2024
https://github.com/charlesXu86/Chatbot_CN
基于金融-司法领域(兼有闲聊性质)的聊天机器人,其中的主要模块有信息抽取、NLU、NLG、知识图谱等,并且利用Django整合了前端展示,目前已经封装了nlp和kg的restful接口
attention-mechanism chatbot-cn deep-learning dialogue-systems django-restful intent-detection ir knowledge-graph ner nlg nlu oriented-dialogs recommendation reinforcement-learning sentiment-analysis slot-filling tenserflow-serving tensorflow text-classification text-correct
Last synced: 01 Aug 2024
https://github.com/arise-initiative/robosuite
robosuite: A Modular Simulation Framework and Benchmark for Robot Learning
physics-simulation reinforcement-learning robot-learning robot-manipulation robotics
Last synced: 01 Oct 2024
https://github.com/kengz/slm-lab
Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".
a2c a3c benchmark deep-reinforcement-learning dqn policy-gradient ppo pytorch reinforcement-learning sac
Last synced: 25 Sep 2024
https://github.com/TradeMaster-NTU/TradeMaster
TradeMaster is an open-source platform for quantitative trading empowered by reinforcement learning :fire: :zap: :rainbow:
finance fintech investment-strategies jupyter-notebook machine-learning python pytorch quantitative-trading reinforcement-learning stock-market trading-platform
Last synced: 01 Aug 2024
https://github.com/ikatsov/tensor-house
A collection of reference Jupyter notebooks and demo AI/ML applications for enterprise use cases: marketing, pricing, supply chain, smart manufacturing, and more.
ai customer-analysis data-science deep-learning llm machine-learning marketing models personalization reinforcement-learning supply-chain
Last synced: 30 Sep 2024
https://github.com/kengz/SLM-Lab
Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".
a2c a3c benchmark deep-reinforcement-learning dqn policy-gradient ppo pytorch reinforcement-learning sac
Last synced: 01 Aug 2024
https://github.com/microsoft/textworld
TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.
reinforcement-learning text-based-adventure text-based-game
Last synced: 30 Sep 2024
https://github.com/rail-berkeley/softlearning
Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.
deep-learning deep-neural-networks deep-reinforcement-learning machine-learning reinforcement-learning soft-actor-critic
Last synced: 30 Sep 2024
https://github.com/morvanzhou/evolutionary-algorithm
Evolutionary Algorithm using Python, 莫烦Python 中文AI教学
distributed-es es evolution-strategies evolution-strategy evolutionary-algorithm genetic-algorithm machine-learning microbial-ga microbial-genetic-algorithm neat nes neural-nets neural-network neuroevolution openai python reinforcement-learning travel-sale-problem travel-sales-problem tutorial
Last synced: 30 Sep 2024
https://github.com/Microsoft/TextWorld
TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.
reinforcement-learning text-based-adventure text-based-game
Last synced: 03 Aug 2024
https://github.com/microsoft/TextWorld
TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.
reinforcement-learning text-based-adventure text-based-game
Last synced: 01 Aug 2024
https://github.com/chainer/chainerrl
ChainerRL is a deep reinforcement learning library built on top of Chainer.
actor-critic chainer deep-learning dqn machine-learning python reinforcement-learning
Last synced: 25 Sep 2024
https://github.com/sudharsan13296/Hands-On-Meta-Learning-With-Python
Learning to Learn using One-Shot Learning, MAML, Reptile, Meta-SGD and more with Tensorflow
deep-meta-learning few-shot-learning keras maml mann matching-networks meta-imitation-learning meta-sgd metalearning ntm one-shot-learning prototypical-network prototypical-networks reinforcement-learning relation-network reptile shot-learning siamese-network tensorflow zero-shot-learning
Last synced: 02 Aug 2024
https://github.com/pku-alignment/safe-rlhf
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
ai-safety alpaca beaver datasets deepspeed gpt large-language-models llama llm llms reinforcement-learning reinforcement-learning-from-human-feedback rlhf safe-reinforcement-learning safe-reinforcement-learning-from-human-feedback safe-rlhf safety transformer transformers vicuna
Last synced: 27 Sep 2024
https://github.com/PKU-Alignment/safe-rlhf
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
ai-safety alpaca beaver datasets deepspeed gpt large-language-models llama llm llms reinforcement-learning reinforcement-learning-from-human-feedback rlhf safe-reinforcement-learning safe-reinforcement-learning-from-human-feedback safe-rlhf safety transformer transformers vicuna
Last synced: 03 Aug 2024
https://github.com/utiasDSL/gym-pybullet-drones
PyBullet Gymnasium environments for single and multi-agent reinforcement learning of quadcopter control
betaflight control crazyflie gym gymnasium multi-agent pybullet quadcopter quadrotor reinforcement-learning robotics sitl stable-baselines3 uav
Last synced: 31 Jul 2024
https://github.com/araffin/rl-baselines-zoo
A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.
gym hyperparameter-optimization hyperparameter-search hyperparameter-tuning hyperparameters openai openai-gym optimization pybullet reinforcement-learning rl stable-baselines zoo
Last synced: 03 Oct 2024
https://github.com/aitorzip/deepgtav
A plugin for GTAV that transforms it into a vision-based self-driving car research environment.
dataset-generation deep-learning gtav reinforcement-learning self-driving-car
Last synced: 30 Sep 2024
https://github.com/patrick-llgc/learning-deep-learning
Paper reading notes on Deep Learning and Machine Learning
3d-object-detection 3d-object-recognition cnn computer-vision deep-learning literature-review machine-learning medical medical-imaging paper paper-reading paper-review point-cloud reinforcement-learning
Last synced: 30 Sep 2024
https://github.com/aitorzip/DeepGTAV
A plugin for GTAV that transforms it into a vision-based self-driving car research environment.
dataset-generation deep-learning gtav reinforcement-learning self-driving-car
Last synced: 02 Aug 2024
https://github.com/khrylx/pytorch-rl
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
a2c deep-reinforcement-learning fisher-vectors generative-adversarial-network policy-gradient ppo proximal-policy-optimization pytorch pytorch-rl reinforcement-learning trpo
Last synced: 30 Sep 2024
https://github.com/quantumiracle/popular-rl-algorithms
PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..
reinforcement-learning soft-actor-critic state-of-the-art
Last synced: 02 Oct 2024
https://github.com/offchan42/machine-learning-curriculum
:computer: Learn to make machines learn so that you don't have to struggle to program them; The ultimate list
chainer convolutional-neural-networks course curriculum deep-learning guide machine-learning mlops-workflow mxnet neural-network python pytorch recurrent-neural-networks reinforcement-learning tensorflow
Last synced: 30 Sep 2024
https://github.com/uvipen/super-mario-bros-ppo-pytorch
Proximal Policy Optimization (PPO) algorithm for Super Mario Bros
ai deep-learning gym mario openai openai-gym ppo ppo2 proximal-policy-optimization python python3 pytorch reinforcement-learning super-mario-bros
Last synced: 30 Sep 2024
https://github.com/oxwhirl/smac
SMAC: The StarCraft Multi-Agent Challenge
benchmark machine-learning multiagent-systems reinforcement-learning starcraft-ii
Last synced: 30 Sep 2024
https://github.com/neymarl/chinesechess-alphazero
Implement AlphaZero/AlphaGo Zero methods on Chinese chess.
alphazero chinese-chess deep-learning reinforcement-learning
Last synced: 30 Sep 2024