An online minimax optimal algorithm for adversarial multiarmed bandit problem

Gökçesu, Kaan; Kozat, Süleyman Serdar

An online minimax optimal algorithm for adversarial multiarmed bandit problem

Files

An_online_minimax_optimal_algorithm_for_adversarial_multiarmed_bandit_problem.pdf (1.84 MB)

Date

2018

Authors

Gökçesu, Kaan

Kozat, Süleyman Serdar

BUIR Usage Stats

3
views

40
downloads

Citation Stats

Abstract

We investigate the adversarial multiarmed bandit problem and introduce an online algorithm that asymptotically achieves the performance of the best switching bandit arm selection strategy. Our algorithms are truly online such that we do not use the game length or the number of switches of the best arm selection strategy in their constructions. Our results are guaranteed to hold in an individual sequence manner, since we have no statistical assumptions on the bandit arm losses. Our regret bounds, i.e., our performance bounds with respect to the best bandit arm selection strategy, are minimax optimal up to logarithmic terms. We achieve the minimax optimal regret with computational complexity only log-linear in the game length. Thus, our algorithms can be efficiently used in applications involving big data. Through an extensive set of experiments involving synthetic and real data, we demonstrate significant performance gains achieved by the proposed algorithm with respect to the state-of-the-art switching bandit algorithms. We also introduce a general efficiently implementable bandit arm selection framework, which can be adapted to various applications.

Source Title

IEEE Transactions on Neural Networks and Learning Systems

Publisher

Institute of Electrical and Electronics Engineers

Keywords

Adversarial multiarmed bandit, Big data, Individual sequence manner, Minimax optimal, Switching bandit

Permalink

http://hdl.handle.net/11693/50275

Published Version (Please cite this version)

https://doi.org/10.1109/TNNLS.2018.2806006

Collections

Scholarly Publications - Electrical and Electronics Engineering

Language

English

Type

Article

Full item page

An online minimax optimal algorithm for adversarial multiarmed bandit problem

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type

An online minimax optimal algorithm for adversarial multiarmed bandit problem

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Share

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type