Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists tagged with reinforcement-learning
A curated list of projects in awesome lists tagged with reinforcement-learning .
https://github.com/Developer-Y/cs-video-courses
List of Computer Science courses with video lectures.
algorithms bioinformatics computational-biology computational-physics computer-architecture computer-science computer-vision database-systems databases deep-learning embedded-systems machine-learning quantum-computing reinforcement-learning robotics security systems web-development
Last synced: 25 Oct 2024
https://github.com/developer-y/cs-video-courses
List of Computer Science courses with video lectures.
algorithms bioinformatics computational-biology computational-physics computer-architecture computer-science computer-vision database-systems databases deep-learning embedded-systems machine-learning quantum-computing reinforcement-learning robotics security systems web-development
Last synced: 24 Nov 2024
https://github.com/labmlai/annotated_deep_learning_paper_implementations
🧑🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
attention deep-learning deep-learning-tutorial gan literate-programming machine-learning neural-networks optimizers pytorch reinforcement-learning transformer transformers
Last synced: 16 Dec 2024
https://github.com/ray-project/ray
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
automl data-science deep-learning deployment distributed hyperparameter-optimization hyperparameter-search java llm-serving machine-learning model-selection optimization parallel python pytorch ray reinforcement-learning rllib serving tensorflow
Last synced: 16 Dec 2024
https://github.com/eugeneyan/applied-ml
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
applied-data-science applied-machine-learning computer-vision data-discovery data-engineering data-quality data-science deep-learning machine-learning natural-language-processing production recsys reinforcement-learning search
Last synced: 23 Nov 2024
https://github.com/d2l-ai/d2l-en
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
book computer-vision data-science deep-learning gaussian-processes hyperparameter-optimization jax kaggle keras machine-learning mxnet natural-language-processing notebook python pytorch recommender-system reinforcement-learning tensorflow
Last synced: 16 Dec 2024
https://github.com/unity-technologies/ml-agents
The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.
deep-learning deep-reinforcement-learning machine-learning neural-networks reinforcement-learning unity unity3d
Last synced: 16 Dec 2024
https://github.com/Unity-Technologies/ml-agents
The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.
deep-learning deep-reinforcement-learning machine-learning neural-networks reinforcement-learning unity unity3d
Last synced: 27 Oct 2024
https://github.com/ddbourgin/numpy-ml
Machine learning, in numpy
attention bayesian-inference gaussian-mixture-models gaussian-processes good-turing-smoothing gradient-boosting hidden-markov-models knn lstm machine-learning mfcc neural-networks reinforcement-learning resnet topic-modeling vae wavenet wgan-gp word2vec
Last synced: 16 Dec 2024
https://github.com/tensorflow/tensor2tensor
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
deep-learning machine-learning machine-translation reinforcement-learning tpu
Last synced: 29 Sep 2024
https://github.com/ai4finance-foundation/fingpt
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
chatgpt finance fingpt fintech large-language-models machine-learning nlp prompt-engineering pytorch reinforcement-learning robo-advisor sentiment-analysis technical-analysis
Last synced: 16 Dec 2024
https://github.com/datawhalechina/leedl-tutorial
《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases
bert chatgpt cnn deep-learning diffusion gan leedl-tutorial machine-learning network-compression pruning reinforcement-learning rnn self-attention transfer-learning transformer tutorial
Last synced: 16 Dec 2024
https://github.com/AI4Finance-Foundation/FinGPT
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
chatgpt finance fingpt fintech large-language-models machine-learning nlp prompt-engineering pytorch reinforcement-learning robo-advisor sentiment-analysis technical-analysis
Last synced: 31 Oct 2024
https://github.com/shangtongzhang/reinforcement-learning-an-introduction
Python Implementation of Reinforcement Learning: An Introduction
artificial-intelligence reinforcement-learning
Last synced: 16 Dec 2024
https://github.com/ShangtongZhang/reinforcement-learning-an-introduction
Python Implementation of Reinforcement Learning: An Introduction
artificial-intelligence reinforcement-learning
Last synced: 30 Oct 2024
https://github.com/kmario23/deep-learning-drizzle
Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!
artificial-intelligence-algorithms artificial-neural-networks bayesian-statistics computer-vision deep-learning deep-neural-networks deep-reinforcement-learning explainable-ai geometric-deep-learning graph-neural-networks machine-learning medical-imaging natural-language-processing optimization pattern-recognition probabilistic-graphical-models probability reinforcement-learning speech-recognition visual-recognition
Last synced: 03 Dec 2024
https://github.com/bulletphysics/bullet3
Bullet Physics SDK: real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc.
computer-animation game-development kinematics pybullet reinforcement-learning robotics simulation simulator virtual-reality
Last synced: 16 Dec 2024
https://github.com/aws/amazon-sagemaker-examples
Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.
aws data-science deep-learning examples inference jupyter-notebook machine-learning mlops reinforcement-learning sagemaker training
Last synced: 16 Dec 2024
https://github.com/datawhalechina/easy-rl
强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/
a3c ddpg deep-reinforcement-learning double-dqn dqn dueling-dqn easy-rl imitation-learning policy-gradient ppo q-learning reinforcement-learning sarsa td3
Last synced: 16 Dec 2024
https://github.com/hvass-labs/tensorflow-tutorials
TensorFlow Tutorials with YouTube Videos
deep-learning machine-learning neural-network python-notebook reinforcement-learning tensorflow tutorial youtube
Last synced: 18 Dec 2024
https://github.com/wandb/wandb
The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.
ai collaboration data-science data-versioning deep-learning experiment-track hyperparameter-optimization hyperparameter-search hyperparameter-tuning jax keras machine-learning ml-platform mlops model-versioning pytorch reinforcement-learning reproducibility tensorflow
Last synced: 21 Dec 2024
https://github.com/Hvass-Labs/TensorFlow-Tutorials
TensorFlow Tutorials with YouTube Videos
deep-learning machine-learning neural-network python-notebook reinforcement-learning tensorflow tutorial youtube
Last synced: 26 Oct 2024
https://github.com/DLR-RM/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
baselines gsde gym machine-learning openai python pytorch reinforcement-learning reinforcement-learning-algorithms robotics sb3 sde stable-baselines toolbox
Last synced: 30 Oct 2024
https://github.com/dlr-rm/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
baselines gsde gym machine-learning openai python pytorch reinforcement-learning reinforcement-learning-algorithms robotics sb3 sde stable-baselines toolbox
Last synced: 16 Dec 2024
https://github.com/morvanzhou/reinforcement-learning-with-tensorflow
Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学
a3c actor-critic asynchronous-advantage-actor-critic ddpg deep-deterministic-policy-gradient deep-q-network double-dqn dqn dueling-dqn machine-learning policy-gradient ppo prioritized-replay proximal-policy-optimization q-learning reinforcement-learning sarsa sarsa-lambda tensorflow-tutorials tutorial
Last synced: 17 Dec 2024
https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow
Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学
a3c actor-critic asynchronous-advantage-actor-critic ddpg deep-deterministic-policy-gradient deep-q-network double-dqn dqn dueling-dqn machine-learning policy-gradient ppo prioritized-replay proximal-policy-optimization q-learning reinforcement-learning sarsa sarsa-lambda tensorflow-tutorials tutorial
Last synced: 01 Nov 2024
https://github.com/lazyprogrammer/machine_learning_examples
A collection of machine learning examples and tutorials.
data-science deep-learning machine-learning natural-language-processing python reinforcement-learning
Last synced: 16 Dec 2024
https://github.com/vowpalwabbit/vowpal_wabbit
Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learning.
active-learning c-plus-plus contextual-bandits cpp learning-to-search machine-learning online-learning reinforcement-learning
Last synced: 16 Dec 2024
https://github.com/VowpalWabbit/vowpal_wabbit
Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learning.
active-learning c-plus-plus contextual-bandits cpp learning-to-search machine-learning online-learning reinforcement-learning
Last synced: 26 Oct 2024
https://github.com/morvanzhou/pytorch-tutorial
Build your neural network easy and fast, 莫烦Python中文教学
autoencoder batch batch-normalization classification cnn dqn dropout gan generative-adversarial-network machine-learning neural-network python pytorch pytorch-tutorial pytorch-tutorials regression reinforcement-learning rnn tutorial
Last synced: 18 Dec 2024
https://github.com/MorvanZhou/PyTorch-Tutorial
Build your neural network easy and fast, 莫烦Python中文教学
autoencoder batch batch-normalization classification cnn dqn dropout gan generative-adversarial-network machine-learning neural-network python pytorch pytorch-tutorial pytorch-tutorials regression reinforcement-learning rnn tutorial
Last synced: 27 Oct 2024
https://github.com/google/trax
Trax — Deep Learning with Clear Code and Speed
deep-learning deep-reinforcement-learning jax machine-learning numpy reinforcement-learning transformer
Last synced: 16 Dec 2024
https://github.com/deepmind/pysc2
StarCraft II Learning Environment
blizzard-api deepmind machine-learning reinforcement-learning starcraft-ii starcraft-ii-replays
Last synced: 08 Dec 2024
https://github.com/google-deepmind/pysc2
StarCraft II Learning Environment
blizzard-api deepmind machine-learning reinforcement-learning starcraft-ii starcraft-ii-replays
Last synced: 17 Dec 2024
https://github.com/lucidrains/palm-rlhf-pytorch
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
artificial-intelligence attention-mechanisms deep-learning human-feedback reinforcement-learning transformers
Last synced: 17 Dec 2024
https://github.com/lucidrains/PaLM-rlhf-pytorch
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
artificial-intelligence attention-mechanisms deep-learning human-feedback reinforcement-learning transformers
Last synced: 31 Oct 2024
https://github.com/tensorlayer/tensorlayer
Deep Learning and Reinforcement Learning Library for Scientists and Engineers
a3c artificial-intelligence chatbot deep-learning dqn gan google imagenet neural-network object-detection python reinforcement-learning tensorflow tensorflow-tutorial tensorflow-tutorials tensorlayer
Last synced: 17 Dec 2024
https://github.com/tensorlayer/TensorLayer
Deep Learning and Reinforcement Learning Library for Scientists and Engineers
a3c artificial-intelligence chatbot deep-learning dqn gan google imagenet neural-network object-detection python reinforcement-learning tensorflow tensorflow-tutorial tensorflow-tutorials tensorlayer
Last synced: 30 Oct 2024
https://github.com/farama-foundation/gymnasium
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
api gym reinforcement-learning
Last synced: 16 Dec 2024
https://github.com/tensorpack/tensorpack
A Neural Net Training Interface on TensorFlow, with focus on speed + flexibility
deep-learning machine-learning neural-networks reinforcement-learning tensorflow
Last synced: 17 Dec 2024
https://github.com/ppwwyyxx/tensorpack
A Neural Net Training Interface on TensorFlow, with focus on speed + flexibility
deep-learning machine-learning neural-networks reinforcement-learning tensorflow
Last synced: 29 Nov 2024
https://github.com/yandexdataschool/practical_rl
A course in reinforcement learning in the wild
course-materials deep-learning deep-reinforcement-learning git-course hacktoberfest keras mooc pytorch pytorch-tutorials reinforcement-learning tensorflow
Last synced: 17 Dec 2024
https://github.com/yandexdataschool/Practical_RL
A course in reinforcement learning in the wild
course-materials deep-learning deep-reinforcement-learning git-course hacktoberfest keras mooc pytorch pytorch-tutorials reinforcement-learning tensorflow
Last synced: 26 Oct 2024
https://github.com/vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
a2c actor-critic advantage-actor-critic ale atari deep-learning deep-reinforcement-learning gym machine-learning phasic-policy-gradient ppo proximal-policy-optimization python pytorch reinforcement-learning wandb
Last synced: 16 Dec 2024
https://github.com/Farama-Foundation/Gymnasium
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
api gym reinforcement-learning
Last synced: 30 Oct 2024
https://github.com/keras-rl/keras-rl
Deep Reinforcement Learning for Keras.
keras machine-learning neural-networks reinforcement-learning tensorflow theano
Last synced: 17 Dec 2024
https://github.com/tju-drl-lab/ai-optimizer
The next generation deep reinforcement learning tookit
deep-learning reinforcement-learning transfer-learning
Last synced: 19 Dec 2024
https://github.com/udacity/deep-reinforcement-learning
Repo for the Deep Reinforcement Learning Nanodegree program
cross-entropy ddpg deep-reinforcement-learning dqn dynamic-programming hill-climbing ml-agents neural-networks openai-gym openai-gym-solutions ppo pytorch pytorch-rl reinforcement-learning reinforcement-learning-algorithms rl-algorithms
Last synced: 27 Nov 2024
https://github.com/TJU-DRL-LAB/AI-Optimizer
The next generation deep reinforcement learning tookit
deep-learning reinforcement-learning transfer-learning
Last synced: 27 Nov 2024
https://github.com/carperai/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
machine-learning pytorch reinforcement-learning
Last synced: 19 Dec 2024
https://github.com/CarperAI/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
machine-learning pytorch reinforcement-learning
Last synced: 25 Oct 2024
https://github.com/BinRoot/TensorFlow-Book
Accompanying source code for Machine Learning with TensorFlow. Refer to the book for step-by-step explanations.
autoencoder book classification clustering convolutional-neural-networks linear-regression logistic-regression machine-learning regression reinforcement-learning tensorflow
Last synced: 27 Oct 2024
https://github.com/binroot/tensorflow-book
Accompanying source code for Machine Learning with TensorFlow. Refer to the book for step-by-step explanations.
autoencoder book classification clustering convolutional-neural-networks linear-regression logistic-regression machine-learning regression reinforcement-learning tensorflow
Last synced: 19 Dec 2024
https://github.com/janhuenermann/neurojs
A JavaScript deep learning and reinforcement learning library.
deep-learning javascript machine-learning neural-network reinforcement-learning self-driving-car
Last synced: 19 Dec 2024
https://github.com/andri27-ts/reinforcement-learning
Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python. Reinforcement Learning + Deep Learning
a2c artificial-intelligence deep-learning deep-reinforcement-learning deepmind dqn evolution-strategies machine-learning policy-gradients ppo qlearning reinforcement-learning
Last synced: 20 Dec 2024
https://github.com/deepmind/open_spiel
OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.
cpp games multiagent python reinforcement-learning
Last synced: 14 Dec 2024
https://github.com/google-deepmind/open_spiel
OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.
cpp games multiagent python reinforcement-learning
Last synced: 17 Dec 2024
https://github.com/andri27-ts/Reinforcement-Learning
Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python. Reinforcement Learning + Deep Learning
a2c artificial-intelligence deep-learning deep-reinforcement-learning deepmind dqn evolution-strategies machine-learning policy-gradients ppo qlearning reinforcement-learning
Last synced: 27 Oct 2024
https://github.com/hill-a/stable-baselines
A fork of OpenAI Baselines, implementations of reinforcement learning algorithms
baselines data-science gym machine-learning openai python reinforcement-learning reinforcement-learning-algorithms toolbox
Last synced: 30 Oct 2024
https://github.com/kwai/douzero
[ICML 2021] DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning | 斗地主AI
doudizhu game-ai poker reinforcement-learning
Last synced: 17 Dec 2024
https://github.com/mathfoundationrl/book-mathematical-foundation-of-reinforcement-learning
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
book courses machine-learning reinforcement-learning tutorials
Last synced: 18 Dec 2024
https://github.com/kwai/DouZero
[ICML 2021] DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning | 斗地主AI
doudizhu game-ai poker reinforcement-learning
Last synced: 02 Nov 2024
https://github.com/MathFoundationRL/Book-Mathematical-Foundation-of-Reinforcement-Learning
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
book courses machine-learning reinforcement-learning tutorials
Last synced: 28 Nov 2024
https://github.com/huggingface/deep-rl-class
This repo contains the syllabus of the Hugging Face Deep Reinforcement Learning Course.
deep-learning deep-reinforcement-learning reinforcement-learning reinforcement-learning-excercises
Last synced: 17 Dec 2024
https://github.com/suragnair/alpha-zero-general
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
alpha-zero alphago alphago-zero alphazero deep-learning gobang gomoku keras mcts monte-carlo-tree-search neural-network othello pytorch reinforcement-learning self-play tensorflow tf
Last synced: 17 Dec 2024
https://github.com/arxivtimes/arxivtimes
repository to research & share the machine learning articles
arxivtimes computer-vision machine-learning natural-language-processing reinforcement-learning
Last synced: 29 Nov 2024
https://github.com/arXivTimes/arXivTimes
repository to research & share the machine learning articles
arxivtimes computer-vision machine-learning natural-language-processing reinforcement-learning
Last synced: 06 Nov 2024
https://github.com/deepmind/dm_control
Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
artificial-intelligence deep-learning machine-learning mujoco neural-networks physics-simulation reinforcement-learning
Last synced: 08 Nov 2024
https://github.com/google-deepmind/dm_control
Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
artificial-intelligence deep-learning machine-learning mujoco neural-networks physics-simulation reinforcement-learning
Last synced: 02 Nov 2024
https://github.com/ai4finance-foundation/elegantrl
Massively Parallel Deep Reinforcement Learning. 🔥
a2c bipedalwalkerhardcore ddpg dqn drl-pytorch efficient gae lightweight model-free-rl multiple-gpu per ppo pytorch reinforcement-learning sac stable td3
Last synced: 17 Dec 2024
https://github.com/AI4Finance-Foundation/ElegantRL
Massively Parallel Deep Reinforcement Learning. 🔥
a2c bipedalwalkerhardcore ddpg dqn drl-pytorch efficient gae lightweight model-free-rl multiple-gpu per ppo pytorch reinforcement-learning sac stable td3
Last synced: 03 Nov 2024
https://github.com/ikostrikov/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
a2c acktr actor-critic advantage-actor-critic ale atari continuous-control deep-learning deep-reinforcement-learning hessian kfac kronecker-factored-approximation mujoco natural-gradients ppo proximal-policy-optimization pytorch reinforcement-learning roboschool second-order
Last synced: 19 Dec 2024
https://github.com/polyaxon/polyaxon
MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle
artificial-intelligence caffe data-science deep-learning hyperparameter-optimization jupyter jupyterlab k8s keras kubernetes machine-learning ml mlops mxnet notebook pipelines pytorch reinforcement-learning tensorflow workflow
Last synced: 16 Dec 2024
https://github.com/google-deepmind/acme
A library of reinforcement learning components and agents
agents reinforcement-learning research
Last synced: 17 Dec 2024
https://github.com/rlcode/reinforcement-learning
Minimal and Clean Reinforcement Learning Examples
a3c actor-critic deep-learning deep-q-network deep-reinforcement-learning dqn machine-learning policy-gradient reinforcement-learning
Last synced: 20 Dec 2024
https://github.com/microsoft/tensorwatch
Debugging, monitoring and visualization for Python Machine Learning and Data Science
ai data-science debug debugging debugging-tool deep-learning deeplearning explainable-ai explainable-ml jupyter jupyter-notebook machine-learning machinelearning model-visualization monitoring python reinforcement-learning saliency
Last synced: 19 Dec 2024
https://github.com/junxiaosong/alphazero_gomoku
An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
alphago alphago-zero alphazero board-game gobang gomoku mcts monte-carlo-tree-search pytorch reinforcement-learning rl self-learning tensorflow
Last synced: 19 Dec 2024
https://github.com/pytorch/elf
ELF: a platform for game research with AlphaGoZero/AlphaZero reimplementation
alpha-zero alphago-zero go reinforcement-learning rl rl-environment
Last synced: 26 Sep 2024
https://github.com/pytorch/ELF
ELF: a platform for game research with AlphaGoZero/AlphaZero reimplementation
alpha-zero alphago-zero go reinforcement-learning rl rl-environment
Last synced: 02 Nov 2024
https://github.com/google-research/football
Check out the new game server:
reinforcement-learning reinforcement-learning-environments
Last synced: 16 Dec 2024
https://github.com/wzhe06/reco-papers
Classic papers and resources on recommendation
deep-learning exploration-exploitation machine-learning recommendation recommender-system reinforcement-learning
Last synced: 20 Dec 2024
https://github.com/junxiaosong/AlphaZero_Gomoku
An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
alphago alphago-zero alphazero board-game gobang gomoku mcts monte-carlo-tree-search pytorch reinforcement-learning rl self-learning tensorflow
Last synced: 18 Nov 2024
https://github.com/wzhe06/Reco-papers
Classic papers and resources on recommendation
deep-learning exploration-exploitation machine-learning recommendation recommender-system reinforcement-learning
Last synced: 14 Nov 2024
https://github.com/catalyst-team/catalyst
Accelerated deep learning R&D
computer-vision deep-learning distributed-computing image-classification image-processing image-segmentation information-retrieval infrastructure machine-learning metric-learning natural-language-processing object-detection python pytorch recommender-system reinforcement-learning reproducibility research text-classification text-segmentation
Last synced: 17 Dec 2024
https://github.com/tensorforce/tensorforce
Tensorforce: a TensorFlow library for applied reinforcement learning
control deep-reinforcement-learning reinforcement-learning system-control tensorflow tensorflow-library tensorforce
Last synced: 16 Dec 2024
https://github.com/paddlepaddle/parl
A high-performance distributed training framework for Reinforcement Learning
large-scale parallelization reinforcement-learning
Last synced: 17 Dec 2024
https://github.com/PaddlePaddle/PARL
A high-performance distributed training framework for Reinforcement Learning
large-scale parallelization reinforcement-learning
Last synced: 31 Oct 2024
https://github.com/astorfi/deep-learning-roadmap
:satellite: Organized Resources for Deep Learning Researchers and Developers
deep-learning reinforcement-learning
Last synced: 20 Dec 2024
https://github.com/astorfi/Deep-Learning-Roadmap
:satellite: Organized Resources for Deep Learning Researchers and Developers
deep-learning reinforcement-learning
Last synced: 07 Nov 2024
https://github.com/tirthajyoti/data-science-best-resources
Carefully curated resource links for data science in one place
analytics api artificial-intelligence aws cheatsheet data-science data-wrangling database deep-learning linux machine-learning neural-network online-course python r reinforcement-learning scikit-learn sql statistics visualization
Last synced: 17 Dec 2024
https://github.com/opendilab/DI-engine
OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.
atari distributed-reinforcement-learning distributed-system drl exploration-exploitation imitation-learning impala inverse-reinforcement-learning minigrid model-based-reinforcement-learning mujoco multiagent-reinforcement-learning offline-rl python pytorch-rl r2d2 reinforcement-learning reinforcement-learning-algorithms self-play smac
Last synced: 02 Nov 2024
https://github.com/opendilab/di-engine
OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.
atari distributed-reinforcement-learning distributed-system drl exploration-exploitation imitation-learning impala inverse-reinforcement-learning minigrid model-based-reinforcement-learning mujoco multiagent-reinforcement-learning offline-rl python pytorch-rl r2d2 reinforcement-learning reinforcement-learning-algorithms self-play smac
Last synced: 15 Dec 2024
https://github.com/tirthajyoti/Data-science-best-resources
Carefully curated resource links for data science in one place
analytics api artificial-intelligence aws cheatsheet data-science data-wrangling database deep-learning linux machine-learning neural-network online-course python r reinforcement-learning scikit-learn sql statistics visualization
Last synced: 07 Nov 2024
https://github.com/seungeunrho/minimalrl
Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)
a2c a3c acer ddpg deep-learning deep-reinforcement-learning dqn machine-learning policy-gradients ppo pytorch reinforce reinforcement-learning sac simple
Last synced: 20 Dec 2024
https://github.com/datamllab/rlcard
Reinforcement Learning / AI Bots in Card (Poker) Games - Blackjack, Leduc, Texas, DouDizhu, Mahjong, UNO.
ai blackjack card-game card-games deep-reinforcement-learning doudizhu game game-ai game-bot gym-environment mahjong multi-agent openai-gym poker poker-game reinforcement-learning texas uno
Last synced: 17 Dec 2024
https://github.com/easy-tensorflow/easy-tensorflow
Simple and comprehensive tutorials in TensorFlow
convolutional-neural-networks deep-learning machine-learning neural-network object-detection pattern-recognition python recurrent-neural-networks reinforcement-learning tensorflow
Last synced: 18 Dec 2024
https://github.com/seungeunrho/minimalRL
Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)
a2c a3c acer ddpg deep-learning deep-reinforcement-learning dqn machine-learning policy-gradients ppo pytorch reinforce reinforcement-learning sac simple
Last synced: 04 Nov 2024
https://github.com/openrlhf/openrlhf
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention)
deepspeed large-language-models raylib reinforcement-learning reinforcement-learning-from-human-feedback transformers vllm
Last synced: 17 Dec 2024
https://github.com/open-spaced-repetition/fsrs4anki
A modern Anki custom scheduling based on Free Spaced Repetition Scheduler algorithm
anki anki-addon deep-learning fsrs intelligent-tutoring-system machine-learning memory optimal-control reinforcement-learning spaced-repetition spaced-repetition-algorithm srs
Last synced: 17 Dec 2024
https://github.com/eugeneyan/ml-surveys
📋 Survey papers summarizing advances in deep learning, NLP, CV, graphs, reinforcement learning, recommendations, graphs, etc.
computer-vision deep-learning embeddings machine-learning nlp recommender-system reinforcement-learning survey
Last synced: 30 Nov 2024