Multi-Armed Bandit

Welcome to the MAB repository. The Multi-Armed Bandit problem is a classic example of decision-making under uncertainty, often encountered in reinforcement learning, optimization, and many other fields. This repo contains the theory and some implementations of various classic MAB algorithms such as $\epsilon$-greedy, UCB, Exp3, Exp4, and Thompson Sampling.

Repository Structure

/algorithms - Contains implementation of various MAB algorithms in Python.
/slides - Contains presentation slides for each topic.
/theory - Includes theoretical explanations and derivations related to MAB problems and algorithms.
/project - Capstone project.

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
algorithms		algorithms
online_learning		online_learning
project		project
slides		slides
visualization		visualization
.DS_Store		.DS_Store
Decoding EEG using Transformer.pdf		Decoding EEG using Transformer.pdf
Final Report.pdf		Final Report.pdf
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-Armed Bandit

Repository Structure

About

Releases

Packages

Languages

HenryVu27/Multi-armed-Bandits-and-Online-Learning

Folders and files

Latest commit

History

Repository files navigation

Multi-Armed Bandit

Repository Structure

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages