Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/risilab/Learnable_MMF
Learning Multiresolution Matrix Factorization and its Wavelet Networks on Graphs
graph-neural-networks matrix-factorization multiresolution-analysis reinforcement-learning wavelet-analysis
Last synced: 03 Jul 2024
https://github.com/MishaLaskin/curl
CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning
contrastive-learning contrastive-loss contrastive-predictive-coding curl deep-learning deep-learning-algorithms deep-neural-networks deep-q-learning deep-q-network deep-reinforcement-learning deep-rl deeplearning deeplearning-ai gpu model-free-rl off-policy reinforcement-agents reinforcement-learning reinforcement-learning-algorithms sac
Last synced: 03 Jul 2024
https://github.com/cyoon1729/distributedRL
A framework for easy prototyping of distributed reinforcement learning algorithms
ape-x distributed-reinforcement-learning dqn ray reinforcement-learning zeromq
Last synced: 03 Jul 2024
https://github.com/aidudezzz/deepbots
A wrapper framework for Reinforcement Learning in the Webots robot simulator using Python 3.
openai-gym-environment python reinforcement-learning robotics webots
Last synced: 03 Jul 2024
https://github.com/Div99/XQL
Extreme Q-Learning: Max Entropy RL without Entropy
deep-learning energy-based-model gumbel-distribution offline-rl reinforcement-learning
Last synced: 03 Jul 2024
https://github.com/idreesshaikh/Autonomous-Driving-in-Carla-using-Deep-Reinforcement-Learning
Deep Reinforcement Learning (PPO) in Autonomous Driving (Carla) [from scratch]
autonomous-driving carla-driving-simulator carla-environment carla-simulator ddqn deep-learning deep-learning-algorithms deep-reinforcement-learning openai ppo proximal-policy-optimization pytorch reinforcement-learning self-driving self-driving-car self-driving-car-simulation self-driving-cars
Last synced: 03 Jul 2024
https://github.com/WassimTenachi/PhySO
Physical Symbolic Optimization
deep-learning equation-discovery machine-learning physics python reinforcement-learning symbolic-regression
Last synced: 03 Jul 2024
https://github.com/kengz/openai_lab
An experimentation framework for Reinforcement Learning using OpenAI Gym, Tensorflow, and Keras.
actor-critic ddpg deep-reinforcement-learning experiment keras openai policy-gradient reinforcement-learning tensorflow
Last synced: 02 Jul 2024
https://github.com/starkblaze01/Artificial-Intelligence-Codes
Collection of Artificial Intelligence Algorithms implemented on various problems
adaptive-smoothing artificial-intelligence-algorithms confidence-intervals decision-tree-classifier epsilon-greedy gaussian-mixture-models genetic-algorithm hidden-markov-model hierarchical-clustering hopfield-network jealous-husband k-means-clustering k-sat menace reinforcement-learning travelling-salesman-problem
Last synced: 02 Jul 2024
https://github.com/OpenQuadruped/spot_mini_mini
Dynamics and Domain Randomized Gait Modulation with Bezier Curves for Sim-to-Real Legged Locomotion.
control environment gait gym-environment locomotion machine-learning open-source openai-gym openai-gym-environments optimization pybullet quadruped quadruped-robot quadruped-robot-gaits quadruped-robots reinforcement-learning ros spot terrain walker
Last synced: 02 Jul 2024
https://github.com/YingtongDou/Nash-Detect
Code for KDD 2020 paper Robust Spammer Detection by Nash Reinforcement Learning
fraud-detection game-theory machine-learning reinforcement-learning security spam-detection
Last synced: 01 Jul 2024
https://github.com/YingtongDou/CARE-GNN
Code for CIKM 2020 paper Enhancing Graph Neural Network-based Fraud Detectors against Camouflaged Fraudsters
datamining deep-learning fraud-detection fraud-prevention graphneuralnetwork machine-learning reinforcement-learning security
Last synced: 01 Jul 2024
https://github.com/aslanides/aixijs
AIXIjs - General Reinforcement Learning in the Browser
artificial-general-intelligence demo javascript machine-learning-algorithms reinforcement-learning
Last synced: 01 Jul 2024
https://github.com/HuantWang/SUPERSONIC
SuperSonic, a new open-source framework to allow compiler developers to integrate RL into compilers easily, regardless of their RL expertise.
code-optimization compiler-optimization deep-reinforcement-learning reinforcement-learning
Last synced: 01 Jul 2024
https://github.com/allenai/savn
Learning to Learn how to Learn: Self-Adaptive Visual Navigation using Meta-Learning (https://arxiv.org/abs/1812.00971)
ai2thor-environment meta-learning reinforcement-learning self-adaptive
Last synced: 01 Jul 2024
https://github.com/eveningdong/DAOSL
Implementation of Domain Adaption in One-Shot Learning
domain-adaptation one-shot-learning reinforcement-learning slim tensorflow
Last synced: 01 Jul 2024
https://github.com/instadeepai/jumanji
🕹️ A diverse suite of scalable reinforcement learning environments in JAX
jax python reinforcement-learning research
Last synced: 30 Jun 2024
https://github.com/rlcode/reinforcement-learning
Minimal and Clean Reinforcement Learning Examples
a3c actor-critic deep-learning deep-q-network deep-reinforcement-learning dqn machine-learning policy-gradient reinforcement-learning
Last synced: 30 Jun 2024
https://github.com/yandexdataschool/AgentNet
Deep Reinforcement Learning library for humans
binder deep-learning deep-neural-networks framework lasagne opeani-gym qlearning reinforcement-learning theano
Last synced: 30 Jun 2024
https://github.com/LyWangPX/Reinforcement-Learning-2nd-Edition-by-Sutton-Exercise-Solutions
Solutions of Reinforcement Learning, An Introduction
exercise-solutions reinforcement-learning self-study solutions
Last synced: 30 Jun 2024
https://github.com/JuliaReinforcementLearning/ReinforcementLearningAnIntroduction.jl
Julia code for the book Reinforcement Learning An Introduction
Last synced: 30 Jun 2024
https://github.com/MaximeVandegar/Papers-in-100-Lines-of-Code
Implementation of papers in 100 lines of code.
3d aes artificial-intelligence deep-learning diffusion-models educational gans generative-model implementation-of-research-paper inverse-rendering machine-learning meta-learning nerf neural-radiance-fields papers python pytorch reinforcement-learning research rl
Last synced: 29 Jun 2024
https://github.com/google-research/robopianist
[CoRL '23] Dexterous piano playing with deep reinforcement learning.
bimanual dexterous-manipulation mujoco piano reinforcement-learning shadow-hand
Last synced: 29 Jun 2024
https://github.com/opendilab/PPOxFamily
PPO x Family DRL Tutorial Course(决策智能入门级公开课:8节课帮你盘清算法理论,理顺代码逻辑,玩转决策AI应用实践 )
course decision-intelligence deep-reinforcement-learning python reinforcement-learning
Last synced: 29 Jun 2024
https://github.com/PaddlePaddle/PARL
A high-performance distributed training framework for Reinforcement Learning
large-scale parallelization reinforcement-learning
Last synced: 29 Jun 2024
https://github.com/chainer/chainerrl
ChainerRL is a deep reinforcement learning library built on top of Chainer.
actor-critic chainer deep-learning dqn machine-learning python reinforcement-learning
Last synced: 29 Jun 2024
https://github.com/google-deepmind/android_env
RL research on Android devices.
android reinforcement-learning
Last synced: 28 Jun 2024
https://github.com/chauncygu/Safe-Multi-Agent-Mujoco
Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.
benchmark reinforcement-learning robotics safe
Last synced: 28 Jun 2024
https://github.com/MathFoundationRL/Book-Mathematical-Foundation-of-Reinforcement-Learning
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
book courses machine-learning reinforcement-learning tutorials
Last synced: 28 Jun 2024
https://github.com/crazydonkey200/neural-symbolic-machines
Neural Symbolic Machines is a framework to integrate neural networks and symbolic representations using reinforcement learning, with applications in program synthesis and semantic parsing.
deep-learning deep-reinforcement-learning program-synthesis reinforcement-learning rnn semantic-parsing tensorflow
Last synced: 28 Jun 2024
https://github.com/open-spaced-repetition/fsrs4anki
A modern Anki custom scheduling based on Free Spaced Repetition Scheduler algorithm
anki anki-addon deep-learning fsrs intelligent-tutoring-system machine-learning memory optimal-control reinforcement-learning spaced-repetition spaced-repetition-algorithm srs
Last synced: 28 Jun 2024
https://github.com/JuliaML/AtariAlgos.jl
Arcade Learning Environment (ALE) wrapped as a Reinforce.jl environment
atari julialang reinforcement-learning
Last synced: 28 Jun 2024
https://github.com/shakenes/vizdoomgym
OpenAI Gym wrapper for ViZDoom enviroments
openai-gym openai-gym-environments reinforcement-learning vizdoom
Last synced: 27 Jun 2024
https://github.com/mpSchrader/gym-sokoban
Sokoban environment for OpenAI Gym
environment gym openai python reinforcement-learning sokoban
Last synced: 27 Jun 2024
https://github.com/PaddlePaddle/XWorld
A C++/Python simulator package for reinforcement learning
reinforcement-learning simulator
Last synced: 26 Jun 2024
https://github.com/Microsoft/TextWorld
TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.
reinforcement-learning text-based-adventure text-based-game
Last synced: 26 Jun 2024
https://github.com/theamrzaki/text_summurization_abstractive_methods
Multiple implementations for abstractive text summurization , using google colab
abstractive-text-summarization ai artificial-intelligence deep-learning deeplearning encoder-decoder google-colab google-colaboratory machine-learning machinelearning nlp pointer-generator policy-gradient reinforcement-learning rnn seq2seq tensorflow text-summarization word2vec
Last synced: 26 Jun 2024
https://github.com/sherrylixuecheng/awesome-ai4chem
Awesome AI for chemistry papers
ai-for-science chemistry deep-learning machine-learning neural-network reinforcement-learning
Last synced: 26 Jun 2024
https://github.com/siomvas/awesome-federated-reinforcement-learning
Publication catalog for research on Federated RL (FRL).
federated-learning federated-reinforcement-learning reinforcement-learning
Last synced: 25 Jun 2024
https://github.com/opendilab/awesome-end-to-end-autonomous-driving
A curated list of awesome End-to-End Autonomous Driving resources (continually updated)
autonomous-driving imitation-learning paper-list reinforcement-learning
Last synced: 25 Jun 2024
https://github.com/tensorlayer/awesome-tensorlayer
A curated list of dedicated resources and applications
adversarial-learning autoencoder cifar-10 computer-vision convolutional-neural-networks database generative-adversarial-network horovod keras lstm-neural-networks mnist natural-language-processing recurrent-neural-networks reinforcement-learning segmentation tensorflow tensorflow-tutorials tensorlayer tf-slim tflearn
Last synced: 25 Jun 2024
https://github.com/Alro10/meta-learning-resources
A curated list of meta-learning resources: papers, coding, tutorials, etc.
deep-reinforcement-learning meta-learning reinforcement-learning
Last synced: 24 Jun 2024
https://github.com/metarl/awesome-metarl
A curated list of awesome Meta Reinforcement Learning
awesome meta-learning meta-rl reinforcement-learning reinforcement-meta-learning
Last synced: 24 Jun 2024
https://github.com/catalyst-team/alchemy
Experiments logging & visualization
deep-learning experiment-track infrastructure keras machine-learning pytorch reinforcement-learning reproducibility research tensorflow
Last synced: 24 Jun 2024
https://github.com/XinJingHao/Deep-Reinforcement-Learning-Algorithms-with-Pytorch
Clean, Robust, and Unified PyTorch implementation of popular DRL Algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)
asl c51 categorical-dqn ddpg deep-reinforcement-learning double-dqn dueling-dqn machine-learning noisynet-dqn ppo prioritized-experience-replay pytorch q-learning reinforcement-learning sac td3
Last synced: 23 Jun 2024
https://github.com/kakaoenterprise/JORLDY
Repository for Open Source Reinforcement Learning Framework JORLDY
framework gym mlagents pytorch reinforcement-learning
Last synced: 23 Jun 2024
https://github.com/SunQingYun1996/Graph-Reinforcement-Learning-Papers
A curated list of graph reinforcement learning papers.
academic-publications awsome-list data-mining deep-learning graph-algorithms graph-neural-networks papers reinforcement-learning
Last synced: 23 Jun 2024
https://github.com/Jingliang-Duan/DSAC-v1
DSAC; Distributional Soft Actor-Critic
pytorch reinforcement-learning
Last synced: 23 Jun 2024
https://github.com/lafmdp/Awesome-Papers-Autonomous-Agent
A collection of recent papers on building autonomous agent. Two topics included: RL-based / LLM-based agents.
agent artificial-intelligence autonomous-agent awesome-paper-collection large-language-models machine-learning natural-language-processing reinforcement-learning
Last synced: 23 Jun 2024
https://github.com/PaddlePaddle/awesome-DeepLearning
深度学习入门课、资深课、特色课、学术案例、产业实践案例、深度学习知识百科及面试题库The course, case and knowledge of Deep Learning and AI
bert classification cnn detection dqn dssm dynabert gan nlp pose recommender-system reinforcement-learning rnn sarsa segmentation tinybert transformer video
Last synced: 22 Jun 2024
https://github.com/AI4Finance-Foundation/ElegantRL
Massively Parallel Deep Reinforcement Learning. 🔥
a2c bipedalwalkerhardcore ddpg dqn drl-pytorch efficient gae lightweight model-free-rl multiple-gpu per ppo pytorch reinforcement-learning sac stable td3
Last synced: 22 Jun 2024
https://github.com/AminHP/gym-anytrading
The most simple, flexible, and comprehensive OpenAI Gym trading environment (Approved by OpenAI Gym)
dqn forex gym-environments openai-gym q-learning reinforcement-learning stocks trading trading-algorithms trading-environments
Last synced: 22 Jun 2024
https://github.com/hubbs5/or-gym
Environments for OR and RL Research
deep-reinforcement-learning operations-research optimization reinforcement-learning supply-chain supply-chain-management vehicle-routing-problem
Last synced: 22 Jun 2024
https://github.com/offchan42/machine-learning-curriculum
:computer: Learn to make machines learn so that you don't have to struggle to program them; The ultimate list
chainer convolutional-neural-networks course curriculum deep-learning guide machine-learning mlops-workflow mxnet neural-network python pytorch recurrent-neural-networks reinforcement-learning tensorflow
Last synced: 20 Jun 2024
https://github.com/jayinai/data-science-question-answer
A repo for data science related questions and answers
data-science deep-learning machine-learning reinforcement-learning sql statistics system
Last synced: 20 Jun 2024
https://github.com/theainerd/MLInterview
:octocat: A curated awesome list of AI Startups in India & Machine Learning Interview Guide. Feel free to contribute!
ai-careers awesome-list data-science data-science-interview deep-learning interview machine-learning machine-learning-interview natural-language-processing reinforcement-learning sql statistics unsupervised-machine-learning
Last synced: 20 Jun 2024
https://github.com/murufeng/Awesome-AI-algorithm
人工智能算法方面的综合资料合集:包括求职面试、机器学习、深度学习、强化学习等方面的资料和代码
algorithms artificial-intelligence deep-learning interview machine-learning reinforcement-learning
Last synced: 20 Jun 2024
https://github.com/wagamamaz/tensorlayer-tricks
How to use TensorLayer
computer-vision data-science deep-learning keras lasagne machine-learning natural-language-processing neural-network neural-networks nlp reinforcement-learning tensorboard tensorflow tensorflow-experiments tensorflow-framework tensorflow-library tensorflow-models tensorflow-tutorials tensorlayer tflearn
Last synced: 20 Jun 2024
https://github.com/awesome-mlss/awesome-mlss
🤖 Machine Learning Summer School deadlines
artificial-intelligence deep-learning machine-learning reinforcement-learning summer-school summer-schools
Last synced: 20 Jun 2024
https://github.com/eugeneyan/ml-surveys
📋 Survey papers summarizing advances in deep learning, NLP, CV, graphs, reinforcement learning, recommendations, graphs, etc.
computer-vision deep-learning embeddings machine-learning nlp recommender-system reinforcement-learning survey
Last synced: 20 Jun 2024
https://github.com/isayev/ReLeaSE
Deep Reinforcement Learning for de-novo Drug Design
cheminformatics deeplearning drug-discovery molecular-modeling qsar reinforcement-learning
Last synced: 20 Jun 2024
https://github.com/quantumiracle/Popular-RL-Algorithms
PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..
reinforcement-learning soft-actor-critic state-of-the-art
Last synced: 20 Jun 2024
https://github.com/IDSIA/modern-srwm
Official repository for the paper "A Modern Self-Referential Weight Matrix That Learns to Modify Itself" (ICML 2022 & NeurIPS 2021 Deep RL Workshop) and "Accelerating Neural Self-Improvement via Bootstrapping" (ICLR 2023 Workshop)
fast-weight-programmers fast-weights pytorch reinforcement-learning rnn-based-few-shot-learning self-referential-learning self-referential-weight-matrix torchbeast transformers
Last synced: 19 Jun 2024
https://github.com/ChenDRAG/SfBC
Codes accompanying the paper "Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling" (ICLR 2023) https://arxiv.org/abs/2209.14548
diffusion expressive generative offline reinforcement-learning score-based
Last synced: 19 Jun 2024
https://github.com/ChenDRAG/CEP-energy-guided-diffusion
Official codebase for Exact Energy-Guided Diffusion Sampling via Contrastive Energy Prediction (ICML 2023)
cep diffusion energy guided offline qgpo reinforcement-learning sampling
Last synced: 19 Jun 2024
https://github.com/hi-abhi/tensorflow-value-iteration-networks
TensorFlow implementation of the Value Iteration Networks (NIPS '16) paper
deep-learning machine-learning neural-networks reinforcement-learning tensorflow
Last synced: 19 Jun 2024
https://github.com/Curt-Park/rainbow-is-all-you-need
Rainbow is all you need! A step-by-step tutorial from DQN to Rainbow
colab-notebook dqn gym-environment nbviewer pytorch rainbow reinforcement-learning
Last synced: 18 Jun 2024
https://github.com/atonkamanda/awesome-ai-brain-computer-interface
Must-read papers on machine learning, deep learning, reinforcement learning and other learning methods for brain-computer interfaces.
awesome awesome-list awesome-lists awesome-readme bci bci-systems brain-computer-interface deep-learning deep-neural-networks deep-reinforcement-learning machine-learning neural neural-interfaces reinforcement-learning robotics
Last synced: 18 Jun 2024
https://github.com/hades217/awesome-ai
A curated list of artificial intelligence resources (Courses, Tools, App, Open Source Project)
artificial-intelligence chatbot data-science deep-learning machine-learning neural-network reinforcement-learning voice-assistant
Last synced: 18 Jun 2024
https://github.com/awarebayes/RecNN
Reinforced Recommendation toolkit built around pytorch 1.7
library news-recommendation pytorch pytorch-rl recommendation-system recommender-system reinforcement-learning toolkit
Last synced: 18 Jun 2024
https://github.com/awjuliani/neuro-nav
A library for neuroscience-inspired navigation and decision making research.
cognitive-science deep-reinforcement-learning gym-environment machine-learning reinforcement-learning
Last synced: 18 Jun 2024
https://github.com/jxx123/simglucose
A Type-1 Diabetes simulator implemented in Python for Reinforcement Learning purpose
artificial-pancreas diabetes glucose-monitoring openai-gym python reinforcement-learning rllab simulation simulator simulator-controls
Last synced: 17 Jun 2024
https://github.com/Khrylx/PyTorch-RL
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
a2c deep-reinforcement-learning fisher-vectors generative-adversarial-network policy-gradient ppo proximal-policy-optimization pytorch pytorch-rl reinforcement-learning trpo
Last synced: 16 Jun 2024
https://github.com/nikhilbarhate99/PPO-PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
deep-learning deep-reinforcement-learning policy-gradient ppo ppo-pytorch proximal-policy-optimization pytorch pytorch-implmention pytorch-tutorial reinforcement-learning reinforcement-learning-algorithms
Last synced: 16 Jun 2024
https://github.com/cahya-wirawan/deepracer-tools
Some tools for AWS Deepracer
aws deepracer machine-learning reinforcement-learning
Last synced: 16 Jun 2024
https://github.com/DoguD/aws-deepracer
Reward function experiments for AWS DeepRacer.
aws deepracer reinforcement-learning
Last synced: 16 Jun 2024
https://github.com/arXivTimes/arXivTimes
repository to research & share the machine learning articles
arxivtimes computer-vision machine-learning natural-language-processing reinforcement-learning
Last synced: 16 Jun 2024
https://github.com/imsheridan/DeepRec
推荐、广告工业界经典以及最前沿的论文、资料集合/ Must-read Papers on Recommendation System and CTR Prediction
computational-advertising deep-learning exploration-exploitation recommendation recommendation-system reinforcement-learning
Last synced: 16 Jun 2024
https://zhengyiluo.github.io/PULSE/
Official Implementation of the ICLR 2023 spotlight paper: Universal Humanoid Motion Representations for Physics-Based Control
control humanoid isaac-gym reinforcement-learning simulation
Last synced: 16 Jun 2024
https://github.com/byminji/SLTtrack
Official Implementation of Towards Sequence-Level Training for Visual Tracking (ECCV 2022)
eccv2022 pytorch reinforcement-learning tracking visual-tracking vot
Last synced: 16 Jun 2024
https://github.com/jiachenli94/Awesome-Decision-Making-Reinforcement-Learning
A selection of state-of-the-art research materials on decision making and motion planning.
algorithms artificial-intelligence autonomous-vehicles causal-inference decision-making deep-learning intelligent-transportation-systems inverse-reinforcement-learning machine-learning motion-control motion-planning motion-prediction multi-agent-learning multiagent-reinforcement-learning reinforcement-learning robotics trajectory-generation trajectory-planning trajectory-prediction
Last synced: 15 Jun 2024
https://github.com/jianguoz/Conversational-AI
Conversational AI Reading Materials
conversational-ai dialogue-systems generative-adversarial-network natural-language-processing papers question-answering reinforcement-learning visual-dialog
Last synced: 15 Jun 2024
https://github.com/tsenghungchen/dialog-generation-paper
A list of recent papers regarding dialogue generation
dialogue dialogue-generation dialogue-systems reinforcement-learning
Last synced: 15 Jun 2024
https://github.com/wagamamaz/tensorflow-tutorial
TensorFlow and Deep Learning Tutorials
autoencoder cnn convolutional-neural-networks deep-learning deep-learning-tutorial deep-reinforcement-learning keras machine-learning multi-layer-perceptron neural-machine-translation neural-network neural-networks nlp notebook recurrent-neural-networks reinforcement-learning tensorflow tensorflow-tutorials tensorlayer tflearn
Last synced: 15 Jun 2024
https://github.com/Islandman93/reinforcepy
Collection of reinforcement learners implemented in python. Mainly including DQN and its variants
Last synced: 15 Jun 2024
https://github.com/steveKapturowski/tensorflow-rl
Implementations of deep RL papers and random experimentation
a3c dqn openai-gym pgq reinforcement-learning tensorflow trpo
Last synced: 15 Jun 2024
https://github.com/awjuliani/DeepRL-Agents
A set of Deep Reinforcement Learning Agents implemented in Tensorflow.
reinforcement-learning tensorflow
Last synced: 15 Jun 2024
https://github.com/hzwer/NIPS2017-LearningToRunACE
2nd place solution of NIPS2017 LearningToRun Competition.
deep-learning keras reinforcement-learning
Last synced: 14 Jun 2024
https://github.com/aurimas13/machine-learning-goodness
The Machine Learning project including ML/DL projects, notebooks, cheat codes of ML/DL, useful information on AI/AGI and codes or snippets/scripts/tasks with tips.
algorithms artifcial-intelligence artificial-intelligence chatgpt cheatsheets computer-science data-science deep-neural-networks deep-reinforcement-learning gpt4 machine-learning machine-learning-algorithms mlops python python3 reinforcement-learning reinforcement-learning-algorithms tips tips-and-tricks
Last synced: 14 Jun 2024
https://github.com/PKU-Alignment/safe-rlhf
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
ai-safety alpaca beaver datasets deepspeed gpt large-language-models llama llm llms reinforcement-learning reinforcement-learning-from-human-feedback rlhf safe-reinforcement-learning safe-reinforcement-learning-from-human-feedback safe-rlhf safety transformer transformers vicuna
Last synced: 14 Jun 2024
https://github.com/OpenLLMAI/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
deepspeed large-language-models raylib reinforcement-learning reinforcement-learning-from-human-feedback transformers vllm
Last synced: 14 Jun 2024
https://github.com/samuki/reinforce-joey
This is a fork of the awesome Joey-NMT with Reinforcement Learning algorithms like Policy Gradient, MRT and Advantage Actor Critic.
neural-machine-translation reinforcement-learning
Last synced: 14 Jun 2024
https://github.com/tomekkorbak/pretraining-with-human-feedback
Code accompanying the paper Pretraining Language Models with Human Preferences
ai-alignment ai-safety decision-transformers gpt language-models pretraining reinforcement-learning rlhf
Last synced: 14 Jun 2024
https://github.com/WoodOxen/tactics2d
Tactics2D: A Reinforcement Learning Environment Library with Generative Scenarios for Driving Decision-making
autonomous-driving gym-environment python3 reinforcement-learning simulator
Last synced: 13 Jun 2024
https://github.com/diditforlulz273/PokerRL-Omaha
Omaha Poker functionality+some features for PokerRL Reinforcement Learning card framwork
cfr counterfactual-regret-minimization deep-learning monte-carlo-tree-search omaha-poker poker-bot pytorch reinforcement-learning reinforcement-learning-algorithms
Last synced: 13 Jun 2024
https://github.com/uoftcprg/pokerkit
[IEEE ToG] An open-source Python library for poker simulations and hand evaluations
artificial-intelligence deep-learning game game-development game-theory holdem-poker imperfect-information-game libratus pluribus poker poker-ai poker-engine poker-evaluator poker-game poker-hands poker-library poker-strategies python reinforcement-learning texas-holdem
Last synced: 13 Jun 2024
https://github.com/EricSteinberger/PokerRL
Framework for Multi-Agent Deep Reinforcement Learning in Poker
deep-learning framework gym-environment poker ray reinforcement-learning reinforcement-learning-algorithms research
Last synced: 13 Jun 2024
https://github.com/liziniu/ReMax
Code for Paper (ReMax: A Simple, Efficient and Effective Reinforcement Learning Method for Aligning Large Language Models)
large-language-models policy-gradient reinforcement-learning rlhf
Last synced: 13 Jun 2024
https://github.com/SherbyRobotics/pyro
An object-based toolbox for robot dynamic simulation, analysis, control and planning.
analysis closed-loop-control control-systems dynamic-programming dynamics optimal optimal-control planning-algorithms python reinforcement-learning robotics
Last synced: 13 Jun 2024