Projects in Awesome Lists tagged with bandit-algorithms
A curated list of projects in awesome lists tagged with bandit-algorithms .
https://github.com/c-bata/goptuna
A hyperparameter optimization framework, inspired by Optuna.
bandit-algorithms bayesian-optimization blackbox-optimization evolution-strategies
Last synced: 04 Apr 2025
https://github.com/sshkhr/practical_rl
My solutions to Yandex Practical Reinforcement Learning course in PyTorch and Tensorflow
bandit-algorithms deep-reinforcement-learning evolutionary-algorithms markov-decision-processes monte-carlo-sampling policy-gradient pytorch reinforcement-learning td-learning tensorflow
Last synced: 11 Apr 2025
https://github.com/sshkhr/Practical_RL
My solutions to Yandex Practical Reinforcement Learning course in PyTorch and Tensorflow
bandit-algorithms deep-reinforcement-learning evolutionary-algorithms markov-decision-processes monte-carlo-sampling policy-gradient pytorch reinforcement-learning td-learning tensorflow
Last synced: 05 May 2025
https://github.com/naereen/kullback-leibler-divergences-and-kl-ucb-indexes
🐍 🔬 Fast Python implementation of various Kullback-Leibler divergences for 1D and 2D parametric distributions. Also provides optimized code for kl-UCB indexes
bandit-algorithms cython divergence kl-ucb kullback-leibler-divergence numba python-library
Last synced: 30 Mar 2025
https://github.com/gjjvdburg/thompsonsampling
Source code for blog post on Thompson Sampling
bandit-algorithms multi-armed-bandit multiarmed-bandits thompson-sampling
Last synced: 11 Feb 2025
https://github.com/albertopirillo/ola-project-2023
Pricing and advertising strategy for the e-commerce of an airline company, based on Multi-Armed Bandits (MABs) algorithms and Gaussian Processes. Simulations include non-stationary environments.
bandit-algorithms marketing-automation online-learning reinforcement-learning
Last synced: 22 Apr 2025
https://github.com/naereen/kullbackleibler.jl
💫 Fast Julia implementation of various Kullback-Leibler divergences for 1D parametric distributions. 🏋 Also provides optimized code for kl-UCB indexes
bandit-algorithms divergence julia-package kl-ucb kullback-leibler-divergence
Last synced: 30 Mar 2025
https://github.com/borealisai/raps
Code for the paper "Causal Bandits without Graph Learning"
Last synced: 12 Apr 2025
https://github.com/duruii/replica-aucb
⚙️REPLICA of "Auction-based combinatorial multi-armed bandit mechanisms with strategic arms"
aucb aution bandit-algorithms bandits cmab mab multi-armed-bandit
Last synced: 12 Mar 2025
https://github.com/niazangels/bandits
An introduction to multi arm bandits
bandit-algorithms multiarm-bandit multiarmed-bandits reinforcement-learning
Last synced: 10 Apr 2025
https://github.com/dkimpara/bandit_oco
Extending Agarwal, Dekel, and Xiao (2010) to the online convex optimization setting with experiments.
bandit-algorithms convex-optimization online-convex-optimization
Last synced: 11 Apr 2025
https://github.com/alextanhongpin/bandit-learn
A knowledge base for Bandit Algorithm
Last synced: 24 Mar 2025
https://github.com/hughrawlinson/bandit-algorithms
🎩🤠Some Bandit Algorithms in Typescript
bandit-algorithms learning optimization
Last synced: 21 Mar 2025