ucb-algorithm

Star

Here are 3 public repositories matching this topic...

amirbalef / PS_MOMAB

Star

Multi-Objective Multi-Armed Bandit

multi-objective multi-armed-bandit non-stationary bandit-algorithms ucb-algorithm

Updated Jul 17, 2023
Python

vismaychuriwala / Optimal-Strategies-in-Multi-Armed-Bandits

Star

This repository contains several implementations of multi-armed bandit (MAB) agents applied to a simulated cricket match where an agent selects among different strategies with the goal of maximizing runs while minimizing the risk of getting out.

reinforcement-learning risk-management kl-divergence proababilistic multiarmed-bandits regret-minimization ucb-algorithm

Updated Mar 21, 2025
Python

meezys / Bernoulli-Bandits

Star

An implementation of Bandit Algorithms, focusing on the case of Bernoulli Rewards.

thompson-sampling stochastic moss bandits bernoulli kl-ucb ucb-algorithm lattimore adaucb explore-then-commit

Updated Jul 22, 2025
Python

Improve this page

Add a description, image, and links to the ucb-algorithm topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ucb-algorithm topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ucb-algorithm

Here are 3 public repositories matching this topic...

amirbalef / PS_MOMAB

vismaychuriwala / Optimal-Strategies-in-Multi-Armed-Bandits

meezys / Bernoulli-Bandits

Improve this page

Add this topic to your repo