Projects in Awesome Lists tagged with bandit-algorithms

https://github.com/c-bata/goptuna

A hyperparameter optimization framework, inspired by Optuna.

bandit-algorithms bayesian-optimization blackbox-optimization evolution-strategies

Last synced: 04 Apr 2025

https://github.com/sshkhr/practical_rl

My solutions to Yandex Practical Reinforcement Learning course in PyTorch and Tensorflow

bandit-algorithms deep-reinforcement-learning evolutionary-algorithms markov-decision-processes monte-carlo-sampling policy-gradient pytorch reinforcement-learning td-learning tensorflow

Last synced: 11 Apr 2025

https://github.com/sshkhr/Practical_RL

My solutions to Yandex Practical Reinforcement Learning course in PyTorch and Tensorflow

bandit-algorithms deep-reinforcement-learning evolutionary-algorithms markov-decision-processes monte-carlo-sampling policy-gradient pytorch reinforcement-learning td-learning tensorflow

Last synced: 05 May 2025

https://github.com/naereen/kullback-leibler-divergences-and-kl-ucb-indexes

🐍 🔬 Fast Python implementation of various Kullback-Leibler divergences for 1D and 2D parametric distributions. Also provides optimized code for kl-UCB indexes

bandit-algorithms cython divergence kl-ucb kullback-leibler-divergence numba python-library

Last synced: 30 Mar 2025

https://github.com/gjjvdburg/thompsonsampling

Source code for blog post on Thompson Sampling

bandit-algorithms multi-armed-bandit multiarmed-bandits thompson-sampling

Last synced: 11 Feb 2025

https://github.com/albertopirillo/ola-project-2023

Pricing and advertising strategy for the e-commerce of an airline company, based on Multi-Armed Bandits (MABs) algorithms and Gaussian Processes. Simulations include non-stationary environments.

bandit-algorithms marketing-automation online-learning reinforcement-learning

Last synced: 22 Apr 2025

https://github.com/naereen/kullbackleibler.jl

💫 Fast Julia implementation of various Kullback-Leibler divergences for 1D parametric distributions. 🏋 Also provides optimized code for kl-UCB indexes

bandit-algorithms divergence julia-package kl-ucb kullback-leibler-divergence

Last synced: 30 Mar 2025